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Gene des l-DesoxY-D-xylulose-Biosynthesewegs 

Die vorliegende Erfindung betrifft DNA-Sequenzen, die bei Inte. 
gration in das Genom von Viren, Eukaryonten und Prokaryonten 
die Isoprenoid-Biosynthese ver&ndern sowie gentechnologische 
Verfahren zur Hersteliung dieser transgenen Viren, Eukaryonten 
und Prokaryonten. Aufierdem betrifft sie Verfahren zur Identifi 
ziereung von Stoffen mit herbizider, antimikrobieller, antipa- 
rasitarer, antiviraler, fungizider, bakterizider Wirkung bei 
Pflanzen und antimikrobieller, antiparasitarer, antimykoti- 
scher, antibakterieller und antiviraler Wirkung bei Mensch und 
Tier. 

Der Biosyntheseweg zur Bildung von Isoprenoiden uber den klas- 
sischen Acetat/ Mevalonat-Weg und einen alternatives Mevalo- 
nat-unabh^ngigen Biosyntheseweg, den Desoxy-D-xylulose- 
Phosphat-Weg, ist bereits bekannt (Rohmer, M. , Knani, M., Simo- 
nin, P., Sutter, B., and Sahm, H. (1993): Biochem. J. 295: 517 
524) . 

Es ist aber nicht bekannt, wie und uber welche Wege in Viren, 
Eukaryonten und Prokaryonten eine Anderung der Isoprenoidkon- 
zentration uber den Desoxy-D-xylulose-Phoshat-Weg erreicht wer- 
den kann. In Fig. 1 ist dieser Biosyntheseweg dargestellt. 

Es werden daher DNA-Sequenzen zur Verfugung gestellt, die fur 
die l-Desoxy-D-xylulose-5-phosphat-Synthase (DOXP-Synthase) , : 
l-Desoxy-D-xylulose-5-phosphatreduktoisomerase (DOXP- 
Reduktoisomerase) oder das gcpE-Protein kodieren. Alle drei Ge- 
ne und Enzyme sind an der Isoprenoid-Biosynthese beteiligt. 

Das gcpE-Protein hat eine Kinasefunktion und katalysiert die 
Phosphorylierung eines Zuckers oder eines Phosphorzuckers oder 
einer Vorstufe der Isoprenoidbiosynthese, insbesondere die 
Phosphorylierung von 2-C-Methyl-D-erythritol, 2-C-Methyl-D- 
erythritol-phosphat, insbesondere 2-C-Methyl-D-erythritol-4~ 
phosphat, 2-OMethyl-D-erythrose, 2-OMethyl-D-erythrose- 
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phosphat, insbesondere 2-C-Methyl-D-erythrose-4-phosphat , In 
der Vorstufe der Isoprenoidsynthese katalysiert das gcpE- 
Protein insbesondere die Phosphorylierung der folgenden Sub- 
stanzen: 

CH 2 (OH) -C (CH 3 ) «C (OH) -CH 2 -0-PO (OH) 2 , CH 2 (OH) -C (CH 3 ) =C (OH) -CH 2 -OH, 

CH 2 (OH) -CH (CH 3 ) -CO-CH 2 -0-PO (OH) 2 , CH 2 (OH) -CH (CH 3 ) -CO-CH 2 -OH 

CH 2 =C (CH 3 ) -CO-CH2-O-PO (OH) 2 , CH 2 =C (CH 3 ) -CO-CH 2 -OH, 

CH 2 =C (CH 3 ) -CH (OH) -CH2-O-PO (OH) 2/ CH 2 -C (CH 3 ) -CH (OH) -CH 2 -OH, 

CH 2 (OH) -C ( -CH 2 ) -C ( OH ) -CH 2 -0-PO (OH) 2 , CH 2 (OH) -C ( -CH 2 ) -C ( OH ) -CH 2 -OH 

CHO-CH (CH 3 ) -CH (OH) -CH 2 -0-PO (OH) 2 , CHO-CH (CH 3 ) -CH (OH) -CH 2 -OH, 

CH 2 (OH)-C(OH) (CH 3 )-CH=CH-0-PO(OH) 2r CH 2 (OH) -C (OH) (CH 3 ) -CH-CH-OH 

CH (OH) -C (CH 3 ) -CH (OH) -CH 2 -0-PO (OH) 2 , CH (OH) =C (CH 3 ) -CH (OH) -CH 2 -OH, 

(CH 3 ) 2 HC-CO-CH 2 -0-PO (OH) 2 , (CH 3 ) 2 HC-CO-CH 2 -0-H, 

(CH 3 ) 2 HC-CH (OH) -CH 2 -0-PO (OH) 2/ (CH 3 ) 2 HC-CH (OH) -CH 2 -0-H. 

Die DOXP-Synthase katalysiert die Kondensation von Pyruvat und 
Glyceraldehyd-3-phosphat zu l-Deoxy-D-xylulose-5-phosphat und 
die DOXP-Reduktoisomerase katalysiert die Umwandlung von 1- 
Deoxy-D-xylulose-5-phosphat zu 2-C-Methyl-D-erythritol-4- 
phosphat* (siehe Fig. 1). 

Die Erfindung betrifft die folgenden DNA-Sequenzen: 
DNA-Sequenzen, die fiir ein Polypeptid mit der in SEQ ID NO: 2 
dargestellten Aminosauresequenz codieren oder fur ein Analoges 
oder Deri vat des Polypeptids gemaii SEQ ID NO: 2, worin eine oder 
mehrere Aminos&uren deletiert, hinzugefugt oder durch andere 
Aminosauren substituiert worden sind, 

DNA-Sequenzen, die f Or ein Polypeptid mit der in SEQ ID NO: 4 
dargestellten Aminosauresequenz codieren oder fur ein Analoges 
oder Derivat des Polypeptids gemaii SEQ ID NO: 4, worin eine oder 
mehrere Aminosauren deletiert, hinzugefugt oder durch andere 
AminosSuren substituiert worden sind, 

sowie DNA-Sequenzen, die fur ein Polypeptid mit der in SEQ ID 
NO: 6 dargestellten Aminosauresequenz codieren oder fur ein 
Analoges oder Derivat des Polypeptids gem&B SEQ ID NO: 6 , worin 
eine oder mehrere Aminosauren deletiert, hinzugefiagt oder durch 
andere Aminosauren substituiert worden sind. 
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Die Gene unci ihre Genprodukte (Polypeptide) sind im Sequ nzpro- 
tokoll mit ihrer Primarstruktur aufgeftthrt und haben folgende 
Zuordnung; 

SEQ ID NO:l: l-Desoxy-D-xylulose-5-phosphatreduktoisomerase-Gen 

SEQ ID NO: 2: l~Desoxy-D-xylulose-5-phosphatreduktoisomerase 

SEQ ID NO: 3: l-Desoxy-D-xylulose-5-phosphat-Synthase-Gen 

SEQ ID NO: 4: l-Desoxy~D-xylulose-5-phosphat-Synthase 

SEQ ID NO: 5: gcpE-Gen 

SEQ ID NO: 6 : gcpE-Proteine. 

Die DNA-Sequenzen stammen alle aus Plasmodium falciparum. 

Aufcer den im Sequenzprotokoll genannten DNA-Sequenzen sind auch 
solche geeignet, die infolge der Degeneration des genetischen 
Codes eine andere DNA-Sequenz besitzen, jedoch fur das'gleiche 
Polypeptid oder fur ein Analoges oder Derivat des Polypeptids 
kodieren, worin eine oder mehrere Aminosauren deletiert, hinzu- 
gefugt oder durch andere Aminosauren substituiert worden sind. 

Die erf indungsgema&en Sequenzen eignen sich fiir die Expression 
von Genen in Viren, Eukaryonten und Prokaryonten, die fur die 
Isoprenoid-Biosynthese des 1-Desoxy-D-xylulose-Wegs verantwort- 
lich sind. 

Erf indungsgemaft gehoren zu den Eukaryonten oder eukaryontischen 
Zellen tierischen Zellen, Pf lanzenzellen, Algen, Hefen, Pilze 
und zu den Prokaryonten oder prokaryontischen Bakterien Archae- 
bakterien und Eubakterien. 

Bei Integration einer DNA-Sequenz in ein Genom, auf der eine 
der oben angegebenen DNA-Sequenzen lokalisiert ist, wird die 
Expression der oben beschriebenen Gene in Viren, Eukaryonten 
und Prokaryonten ermoglicht. Die erf indungsgemaft transf ormier- 
ten Viren, Eukaryonten und Prokaryonten werden in an sich be- 
kannter Weise geziichtet und das wahrenddessen gebildete Isopre- 
noid isoliert und gegebenenfalls gereinigt. Nicht alle Isopre- 
noide mtissen isoliert werden, da die Isoprenoide in einigen 
Fallen direkt in die Raumluft abgegeben werden. 
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Die Erfindung betrifft ferner ein Verfahren zur Herstellung von 
transgenen Viren, Eukaryonten und Prokaryonten zur Veranderung 
des Isoprenoid-Gehaltes, das die folgenden Schritte enthSlt. 

a) Herstellung einer DNA-Sequenz mit folgenden Teilsequenzen 

i) Promotor, der in Viren, Eukaryonten und Prokaryonten 
aktiv ist und die Bildung einer RNA im vorgesehenen 
Zielgewebe Oder den Zielzellen sicherstellt, 

ii) DNA-Sequenz, die fiir ein Polypeptid mit der in SEQ ID 
NO: 2, 4 oder 6 dargestellten Aminosauresequenz codieren 
oder fiir ein Analoges oder Derivat des Polypeptids ge- 
mafi SEQ ID NO:2,4 Oder 6, 

iii) 5 s - und 3 % -nichttranslatierte Sequenz, die in Viren, 
Eukaryonten und Prokaryonten die Expression der be- 
zeichneten Gene ermoglichen oder verbessern, 

b) Transfer und Einbau der DNA-Sequenz in das Genom von Viren, 
prokaryontischen oder eukaryontischen Zellen mit oder ohne 
Verwendung eines Vektors (z.B. Plasmid, virale DNA) . 

Aus derart transf ormierten Pf lanzenzellen konnen die intakten 
ganzen Pflanzen regeneriert werden. 

Die fur die Proteine kodierenden Sequenzen mit den Nukleotidab- 
folgen Seq ID NO:l, Seq ID NO: 3 und Seq ID NO: 5 konnen mit ei- 
nem die Transkription in bestimmten Organen oder Zellen sicher- 
stellenden Promotor versehen werden, der in sense-Orientierung 
{3*-Ende des Promotors zum 5"-Ende der kodierenden Sequenz) an 
die Sequenz, die das zu bildende Protein kodiert, gekoppelt 
ist. An das 3 % -Ende der kodierenden Seqeunz wird ein die Termi- 
nation der mRNA-Synthese bestimmendes Terminationssignal ange- 
h£ngt. Urn das zu exprimierende Protein in bestimmte subzellula- 
re Kompartimente, wie Chloroplasten, Amyloplasten, Mitochondri- 
en, Vakuole, Cytosol oder Interzellularraume zu dirigieren, 
kann zwischen den Promotor und die kodierende Sequenz noch eine 
fiir eine sogenannte Signalsequenz oder ein Transitpeptid kodie- 
rende Sequenz gesetzt werden. In einigen Fallen ist es erfor- 
derlich, Sequenzen einzufiigen, die fiir eine Signalsequenz am 
COOH-Terminus des Proteins kodieren. Die Sequenz muG im glei- 



WO 00/17233 _ _ PCT/EP99/07055 

chen Leserahmen wie die kodierende Sequenz des Proteins sein. 
Zur Vorbereitung der Einftihrung der erf indungsgemSfien DNA- 
Sequenzen in hdhere Pflanzen sind eine grofie Anzahl von Klonie- 
rungsvektoren verfugbar, die ein Replikationssignal fiir E.coli 
und einen Marker beinhalten, der eine Selektion der transfor- 
mierten Zellen erlaubt. Je nach Einftihrungsmethode gewunschter 
Gene in die Pflanze kSnnen weitere DNA-Sequenzen erforderlich - 
sein. Werden zum Beispiel fur die Transformation der Pflanzen- 
zelle das Ti- oder Ri-Plasmid verwendet, so mufc mindestens eine 
rechte Begrenzung, haufig jedoch die rechte und die linke Be- 
grenzung der Ti- und Ri-Plasmid T-DNA als Flankenbereich den 
einzufiihrenden Genen eingeftigt werden. Die Verwendung von T-DNA 
fur die Transformation von Pf lanzenzellen ist intensiv unter- 
sucht und ausreichend in EP 120516; Hoekama, in: The Binary 
Plant Vector System, Of f set-drukkeri j Kanters B.V. Alblasserdam 
(1985), Chapter V; Fraley et al., Crit . Rev. Plant Sci. 3,1-46 
und An et al. (1985) EMBO J. 4, 277-287 beschrieben worden. Ist 
die eingefiihrte DNA einmal im Genom integriert, so ist sie in 
der Regel stabil und bleibt auch in den Nachkommen der ur- 
spriinglich transformierten Zellen erhalten. Sie erhalt norma- 
lerweise einen Selektionsmarker, der den transformierten Pflan- 
zenzellen Resistenz gegenuber einem Biozid oder einem Antibio- 
tikum, wie Kanamycin, G 418, Bleomycin, Hygromycin oder Phos- 
phinotricin u.a. vermittelt. Der individuell verwendete Marker 
sollte daher die Selektion transf ormierter Zellen gegenuber 
Zellen, denen die eingefiigte DNA fehlt, gestatten. 

Fur die Einfuhrung von DNA in eine Pflanze stehen viele Techni- 
ken zur Verfugung. Diese Techniken umfassen die Transformation 
mit Hilfe von Agrobakterien, z.B. Agrobacterium tumefaciens , 
die Fusion von Protoplasten, die Mikroinjektion von DNA, die 
Elekroporation, sowie ballistische Methoden und die Virusinfek- 
tion. Aus dem transformierten Pf lanzenmaterial konnen dann im 
geeigneten Medium, welches Antibiotika oder Biozide zur Selek- 
tion enthalten kann, wieder ganze Pflanzen regeneriert werden. 
Bei der Injektion und Elektroporation sind an sich keine spezi- 
ellen Anforderungen an die Plasmide gestellt. Sollen aber aus 
derartig transformierten Zellen ganze Pflanzen regeneriert wer- 
den, ist die Anwesenheit eines selektierbaren Markergens not- 
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wendig. Die transformierten Zellen wachsen innerhalb der Pflan- 
zen in der ttblichen Weise (McCormick et al. (1986), Plant Cell 
Reports 5, 81-84). Die Pflanzen konnen "normal angezogen werden 
und mit Pflanzen, die die gleiche transformierte Erbanlage oder 
andere Erbanlagen haben, gekreuzt werden. Die daraus entstehen- 
den Individuen haben die entsprechenden ph&notypischen Eigen- 
schaften, 

Weiterhin sind Gegenstand der Erfindung Expressionsvektoren, 
die eine oder mehrere der erf indungsgemSfien DNA-Sequenzen ent- 
halten. Solche Expressionsvektoren erhalt man, indem man die 
erfindungsgemafien DNA-Sequenzen mit geeigneten f unktionellen 
Regulationssignalen versieht. Solche Regulationssignale sind 
DNA-Sequenzen f die fur die Expression verantwortlich sind, bei- 
spielsweise Promotoren, Operatoren, Enhancer, ribosomale Bin- 
dungsstellen, und die vom Wirtsorganismus erkannt werden. 

Gegebenenfalls konnen noch weitere Regulationssignale, die bei- 
spielsweise Replikation oder Rekombination der rekombinanten 
DNA im Wirtsorganismus steuern, Bestandteil des Expressionsvek- 
tors sein. 

Ebenso gehoren die mit den erf indungsgemaJien DNA-Sequenzen oder 
Expressionsvektoren transformierten Wirtsorganismen zum Gegen- 
stand der Erfindung. 

Fur die Expression der erf indungsgemaften Enzyme eignen sich be- 
sonders solche Wirtszellen und Organismen, die keine intrinsi- 
schen Enzyme mit der Funktion der DOXP-Synthase, der DOXP- 
Reduktoisomerase oder des gcpE-Proteins aufweisen* Dies trifft 
fur Archaebacterien, Tiere, Pilze, Schleimpilze und einige Eu- 
bakterien zu. Durch das Fehlen dieser intrinsischen Enzymakti- 
vit&ten wird die Detektion und Aufreinigung der rekombinanten 
Enzyme wesentlich erleichtert. Auch wird es erst dadurch m6g- 
lich, mit geringem Aufwand die Aktivitat und insbesondere die 
Hemmung der Aktivitat der erf indungsgemaften rekombinanten Enzy- 
me durch verschiedenen Chemikalien und Pharmaka in Rohextrakten 
aus den Wirtszellen zu messen. 
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Die Expression der erfindungsgem&fien En2yme erfolgt vorteilhaf- 
terweise dann in eukaryontischen Zellen, wenn posttranslatori- 
sche Modif ikationen und eine native Faltung der Polypeptidkette 
erreicht werden soli. Aufierdem wird in Abhangigkeit vom Expres- 
sionssystem bei der Expression genomischer DNA-Sequenzen er- 
reicht, dafi Introns durch SpleiBen der DNA beseitigt und die 
Enzyme in der fUr die Parasiten charakteristischen Polypep- 
tidsequenz produziert werden- Fur Introns codierende Sequenzen 
konnen auch durch rekombinante DNA-Technologie aus den zu ex- 
primierenden DNA-Sequenzen beseitigt oder experimentell einge- 
fiigt werden. 

Die Isolierung des Proteins kann aus der Wirtszelle oder dem 
Kulturuberstand der Wirtszelle nach dem Fachmann bekannten Ver- 
fahren erfolgen. Es kann auch eine in vitro Reaktivierung der 
Enzyme erforderlich sein. 

Zur Erleichterung der Aufreinigung konnen die erf indungsgemaBen 
Enzyme oder Teilsequenzen der Enzyme als Fusionsprotein mit 
verschiedenen Peptidketten exprimiert werden. Dazu eigenen sich 
besonders Oligo-Histidin-Sequenzen und Sequenzen, die von der 
Glutathion-S-Transferase, Thioredoxin oder Calmodulin-bindenden 
Peptiden abgeleitet sind- 

Weiterhin konnen die erf indungsgemaBen Enzyme oder Teilsequen- 
zen der Enzyme als Fusionsprotein mit solchen, dem Fachmann be- 
kannten, Peptidketten exprimiert werden, daB die rekombinanten 
Enzyme in das extrazellul&re Millieu oder in bestimmte Kompar- 
timente der Wirtszellen transport iert werden. Dadurch kann so- 
wohl die Aufreinigung, als auch die Untersuchung der biologi- 
schen Aktivit&t der Enzyme erleichtert werden • 

Bei der Expression der erf indungsgemaBen Enzyme kann es sich 
als zweckmaBig erweisen, einzelne Codone zu verandern. Dabei 
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ist der gezielte Austausch von Basen in der kodierenden Region 
auch sinnvoll, wenn die genutzten Codone in den Parasiten ab- 
weichend sind von der Codonnutzung im heterologen Expressions- 
system, um eine optimale Synthese des Proteins zu gewahrlei- 
sten. 

Weiterhin konnen die erf indungsgemaiien Enzyme unter standardi- 
sierten Bedingungen durch dem Fachmann bekannte Techniken durch 
in vitro-Translation gewonnen werden. Dafiir geeignete Systeme 
sind Kaninchen-Reticulozyten- und Weizenkeimextrakte und Bakte- 
rienlysate. Auch kann in vitro transskribierte mRNA in Xenopus- 
Oocyten translatiert werden. 

Durch chemische Synthese kbnnen Oligo- und Polypeptide herge- 
stellt werden, deren Sequenzen aus der Peptidsequenz der erfin- 
dungsgemaflen Enzyme abgeleitet sind. Bei geeigneter Wahl der 
Sequenzen besitzen derartige Peptide Eigenschaf ten, die fur die 
erf indungsgemaBen Enzyme charakteristisch sind. Derartige Pep- 
tide konnen in groBen Mengen hergestellt werden und eignen sich 
besonders fur Studien uber die Kinetik der Enzymaktivitat , die 
Regulation der Enzymaktivitat, die dreidimensionale Struktur 
der Enzyme, die Hemmung der Enzymaktivitat durch verschiedenen 
Chemikalien und Pharmaka und die Bindungsgeometrie und Bindugn- 
saffinitat verschiedener Liganden. 

Vorzugsweise wird zur rekombinanten Herstellung der erfindungs- 
gemaBen Enzyme eine DNA mit den Nukleotiden aus den Sequenzen 
SEQ ID NO: 1, 3 und 5 verwendet. 

Die Erfindung umfafit daher auBerdem ein Verfahren zum Screening 
nach Verbindungen, die desDesoxy-D-xylulose-Phosphat- 
Stof fwechselweg inhibieren. GemaB diesem Verfahren wird ein 
Wirtsorganismus, der einen rekombinanten Expressionsvektor ent- 
halt, wobei der Vektor zumindest einen Teil der Olignukleo- 
tidsequenz gemaft SEQ ID N0:1, SEQ ID NO: 3 oder SEQ ID NO: 5 
oder Varianten oder Homologe dieser aufweist, und auBerdem eine 
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Verbindung, von der vermutet wird, dafi sie eine antimikrobiel- 
le, antiparasitare, antibakterielle, antivirale und antimykoti,- 
sche Wirkung bei Mensch und Tier oder eine antimikrobielle, an- 
tivirale, bakterizide, herbizide oder fungizide Wirkung bei 
Pflanzen hat f bereitgestellt . Ans.chliefiend wird der Wirtsorga- . 
nismus mit der Verbindung in Kontakt gebracht und die Wirksam- 
keit der Verbindung bestimmt. 

Ein weiterer Gegenstand dieser Erfindung sind Methoden zur Be- 
stimmung der enzymatische Aktivitat des gcpE-Proteins. Diese 
kann nach bekannten Verfahren bestimmt werden. Hierbei wird die 
Phosphorylierung eines Zuckers oder eines Phosphorzuckers oder 
einer Vorstufe der Isoprenoidbiosynthese, insbesondere die 
Phosphorylierung von 2-C-Methyl-D-erythritol, 2-C-Methyl-D- 
erythritol-phosphat, insbesondere 2-C-Methyl-D-erythritol-4- 
phosphat, 2-OMethyl-D-erythrose, 2-C-Methyl-D-erythrose- 
phosphat, insbesondere 2-C-Methyl-D-erythrose-4-phosphat , de- 
tektiert. Ein weiterer Gegenstand dieser Erfindung ist die Ver- 
wendung dieser Meliverf ahren zur Ermittlung von Stoffen, die die 
Aktivitat der jeweiligen Enzyme inhibieren. 

Die enzymatische Aktivitat von DOXP-Synthase und DOXP- 
Reduktisomerase kann in einem einzigen Schitt detektiert wer- 
den, indem die Umwandlung von Glycerinaldehyd-3-phosphat zu 2- 
C-Methylerythritol-4-phosphat bestimmt wird. 

Analog erfolgt die Bestimmung der Aktivitaten von DOXP-Synthase 
und DOXP-Reduktoisomerase. Fur die Bestimmung der DOXP- 
Synthase-Aktivit£t eignen sich auch f luorimetrische Verfahren, 
wie von Querol et al. beschrieben (Querol et al. Abstracts 4 th 
european symposium on plant isoprenoids, Barcelona 21-23 April 
1999) . 
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Patentanspriiche 

1. DNA-Sequenzen, die fiir ein Polypeptid mit der in SEQ ID NO: 
2 dargestellten Aminosauresequenz codieren oder fiir ein 
Analoges oder Derivat des Polypeptids gemaB SEQ ID NO: 2, 
worin eine oder mehrere Aminosauren deletiert, hinzugefiigt 
oder durch andere Aminos&uren substituiert worden sind. 

2. DNA-Sequenzen, die far ein Polypeptid mit der in SEQ ID NO: 
4 dargestellten Aminosauresequenz codieren oder f iir ein 
Analoges oder Derivat des Polypeptids gemaB SEQ ID NO: 4, 
worin eine oder mehrere Aminosauren deletiert, hinzugefiigt 
oder durch andere Aminosauren substituiert worden sind. 

3- DNA-Sequenzen, die fiir ein Polypeptid mit der in SEQ ID NO: 
6 dargestellten Aminosauresequenz codieren oder fiir ein 
Analoges oder Derivat des Polypeptids gemaB SEQ ID NO: 6, 
worin eine oder mehrere Aminosauren deletiert, hinzugefiigt 
oder durch andere Aminosauren substituiert worden sind. 

4. DNA-Sequenz gemaB einem der Anspriiche 1 bis 3, dadurch ge- 
kennzeichnet, daB sie aufierdem funktionelle Regulations- 
signale, insbesondere Promotoren, Operatoren, Enhancer, ri- 
bosomale Bindungsstellen, aufweist. 

5. DNA-Sequenz mit folgenden Teilsequenzen 

i) Promotor, der in Viren, Eukaryonten und Prokaryonten 
aktiv ist und die Bildung einer RNA im vorgesehenen 
Zielgewebe oder den Zielzellen sicherstellt, 

ii) DNA-Sequenzen gemSfi einem der Anspriiche 1 bis 3, 

iii) 3 *-nichttranslatierte Sequenz, die in Viren, Eukaryon- 
ten und Prokaryonten zur Addition von Poly-A Resten an 
das 3*-Ende der RNA ftthrt. 

6. Verfahren zur Herstellung von transgenen Viren, Eukaryonten 
und Prokaryonten zur Veranderung des Isoprenoid-Gehaltes, 
dadurch gekennzeichnet, daft eine DNA-Sequenz gemaB Anspruch 
4 oder 5 in das Genom von Viren, eukaryontischen und proka- 
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ryontischen Zellen mit oder ohne Verwenduhg eines Vektors 
transferiert und eingebaut wird. 

7. Transgene Systeme, insbesondere Pflanzen und Pflanzenzel- 
len, welche ein oder mehrere DNA-Sequenzen gemSft der An- 
spruche 1 bis 5 als „fremde n oder „zusatzliche" DNA enthal- 
ten f die exprimiert werden. 

8. Expressionsvektor, enthaltend eine oder mehrere DNA- 
Sequenzen gemafi Anspruch 1 bis 5. 

9. Protein, welches am l-Deoxy-D-Xylulose-5-Phosphat- 

Stof fwechselweges beteiligt ist und a) codiert wird von den 
DNA-Sequenzen SEQ ID NO: 1,3 oder 5 oder b) codiert wird 
von DNA-Sequenzen, die mit den DNA-Sequenzen SEQ ID NO: 
1,3,5 oder Fragmenten dieser DNA-Sequenzen im DNA-Bereich, 
der fur das reife Protein codiert, hybridisieren. 

10. Protein nach den Anspruch 9, erhSltlich aus den Kulturuber- 
standen von Parasiten oder aus den auf geschlossenen Parasi- 
ten und Aufreinigung liber chromatographische und elektro- 
phoretische Techniken. 

11. Protein nach einem der Anspriiche 9 und 10, dadurch gekenn- 
zeichnet, da& es a) das Produkt einer viralen, prokaryonti- 
schen oder eukaryontischen Expression einer exogenen DNA 
ist, b) codiert wird von den Sequenzen SEQ ID NO: 1, 3 oder 
5 oder codiert wird von DNA-Sequenzen, die mit den in den 
DNA-Sequenzen SEQ ID NO: 1, 3 oder 5 oder Fragmenten dieser 
DNA-Sequenzen im DNA-Bereich, der ftir das reife Protein ko- 
diert, hybridisieren, oder c) codiert wird von DNA- 
Sequenzen, die ohne Degeneration des genetischen Codes mit 
den in b) definierten Sequenzen hybridisieren wurden und 
fur ein Polypeptid mit entsprechender Aminosaure-Sequenz 
kodieren. 
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12. Protein gemaii einem der vorangehenden Ansprtichen, dadurch 
gekennzeichnet, dafi es die Aminosauresequenzen SEQ ID NO: 
2, 4 oder 6 aufweist. 

13. Verfahren zur Bestimmung der enzymatischen Aktivitat des 
gcpE-Proteins, dadurch gekennzeichnet, dafi Phosphorylierung 
eines Zuckers oder eines Phosphorzuckers oder einer Vorstu- 
fe der Isoprenoidbiosynthese, insbesondere die Phosphory- 
lierung von 2~C-Methyl-D-erythritol, 2-C-Methyl-D- 
erythritol-phosphat, insbesondere 2-C-Methyl-D-erythritol- 
4-phosphat, 2-C-Methyl-D-erythrose, 2-C-Methyl-D-erythrose- 
phosphat , insbesondere 2-C-Methyl-D-erythrose-4-phosphat , 
und der Phosphat- und Alkoholvorstuf en, detektiert wird. 

14. Verfahren nach Anspruch 13, dadurch gekennzeichnet, daii die 
Phosphorylierung der folgenden Phosphate oder Alkohole de- 
tektiert wird: 

CH 2 (OH) -C (CH 3 ) =C (OH) -CH 2 -0-PO (OH) 2 , 
CH 2 (OH) -C (CH 3 ) =C (OH) -CH 2 -OH, 

CH 2 (OH) -CH (CH 3 ) -CO-CH2-O-PO (OH) 2/ CH 2 (OH) -CH (CH 3 ) -CO-CH 2 -OH 

CH 2 =C (CH 3 ) -CO-CH 2 -0-PO (OH) 2 , CH 2 =C (CH 3 ) -CO-CH 2 -OH, 

CH 2 -C (CH 3 ) -CH (OH) -CH2-O-PO (OH) 2 , CH 2 -C (CH 3 ) -CH (OH) -CH 2 -OH, 

CH 2 (OH) -C <=CH 2 ) -C (OH) -CH 2 -0-PO (OH) 2 , 

CH 2 (OH) -C (=CH 2 ) -C (OH) -CH 2 -OH 

CHO-CH(CH 3 )-CH(OH)-CH 2 -0-PO(OH) 2 , CHO-CH (CH 3 ) -CH (OH) -CH 2 -OH, 

CH 2 (OH)-C(OH) (CH 3 )-CH=CH-0-PO(OH) 2 , 

CH 2 (OH)-C(OH) (CH 3 )-CH=CH-OH 

CH (OH) =C (CH 3 ) -CH (OH) -CH 2 -0-PO (OH) 2 , 

CH (OH) =C (CH 3 ) -CH (OH) -CH 2 -OH, 

(CH 3 ) 2 HC-CO-CH 2 -0-PO (OH) 2, 

(CH 3 ) 2 HC-CO-CH 2 -0~H, 

(CH 3 ) 2 HC-CH (OH) -CH 2 -0-PO (OH) 2 , 

(CH 3 ) 2 HC-CH (OH) -CH 2 -0-H. 

15. Verfahren zur gekoppelten Bestimmung der enzymatischen Ak- 
tivitat der DOXP-Synthase und der DOXP-Reduktase, dadurch 
gekennzeichnet, dafi die Umwandlung von Glycerinaldehyd-3- 
phosphat zu 2-C-Methylerythritol-4-phosphat detektiert 
wird. 
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16. Verfahren zum Screening einer Verbindung .fur die Therapie 
von infektiosen Prozessen bei Mensch und Tier, wobei das 
Verfahren umfafit: 

a) Bereitstellen einer Wirtszelle, die einen rekombinanten 
Expressions vektor enthalt, wobei der Vektor zumindest 
einen Teil der Olignukleotidsequenz gemafi SEQ ID NO:l, - 
SEQ ID NO: 3 oder SEQ ID NO: 5 oder Varianten Oder Ana- 
loga dieser aufweist, und aulierdem eine Verbindung, von 
der vermutet wird, dafi sie eine antimykotische, antibio- 
tische, antiparasitare oder antivirale Wirkung bei 
Mensch und Tier hat, 

b) In-Kontakt-Bringen der Wirtszelle mit der Verbindung und 

c) Bestimmung der antimikrobiellen, antimykotischen, anti- 
biotischen, antiparasit£ren oder antiviralen Wirksamkeit 
der Verbindung. 

17 . Verfahren zum Screening nach Verbindungen zur Behandlung 
von Pflanzen, wobei das Verfahren umfaBt: 

a) Bereitstellen einer Wirtszelle, die einen rekombinanten 
Expressionsvektor enthalt, wobei der Vektor zumindest 
einen Teil der Olignukleotidsequenz gemafi SEQ ID NO:l, 
SEQ ID NO: 3 oder SEQ ID NO: 5 oder Varianten oder Ana- 
loga dieser aufweist, und aufierdem eine Verbindung, von 
der vermutet wird, da£ sie eine antimikrobielle, antivi- 
rale, antiparasitare, bakterizide, fungizide oder herbi- 
zide Wirkung bei Pflanzen hat, 

b) In-Kontakt-Bringen der Wirtszelle mit der Verbindung und 

c) Bestimmung der antimikrobiellen, antiviralen, antipara- 
sitaren, bakteriziden, fungiziden oder herbiziden Wirk- 
samkeit der Verbindung. 

18. Verwendung von DNA nach einem der Anspruche 1 bis 5 oder 
von Proteinen nach einem der Anspriiche 9 bis 12 oder von 
transgenen Systemen nach Anspruch 7 zur Vorbeugung oder 
Therapie von Erkrankungen bei Mensch und Tier. 
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<110> Jomaa, Hassan 

<120> Gene des 1-Desoxy-D-xylulose-Biosynthesewegs 

<130> 15696 

<140> PCT/EP99 
<141> 1999-09-22 

<150> DE19923567.8 
<151> 1999-05-22 

<150> DE19843279.8 
<151> 1998-09-22 

<160> 6 

<170> Patentln Ver. 2.1 

<210> 1 
<211> 1467 
<212> DNA 

<213> Plasmodium falciparum 

<220> 

<221> CDS 

<222> (1) . . (1467) 

<220> 

<221> gene 

<222> (1) . . (1467) 

<220> 

<221> raRNA 

<222> (1) . . (1467) 

<400> 1 

arg aag aaa tat att tat ata tat ttt ttc ttc ate aca ata act att 48 

Met Lys Lys Tyr lie Tyr lie Tyr Phe Phe Phe lie Thr He Thr He 
15 10 15 
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aat gat tta gta ata aat aat aca tea aaa tgt gtt tec att gaa aga 96 
Asn Asp Leu Val lie Asn Asn Thr Ser Lys Cys Val Ser lie Glu Arg 
20 25 30 

aga aaa aat aac gca tat ata aat tat ggt ata gga tat aat gga cca 144 
Arg Lys Asn Asn Ala Tyr lie Asn Tyr Gly lie Gly Tyr Asn Gly Pro 
35 40 45 

gat aat aaa ata aca aag agt aga aga tgt aaa aga ata aag tta tgc 192 
Asp Asn Lys lie Thr Lys Ser Arg Arg Cys Lys Arg lie Lys Leu Cys 
50 55 60 

aaa aag gat tta ata gat att ggt gca ata aag aaa cca att aat gta 240 
Lys Lys Asp Leu lie Asp lie Gly Ala lie Lys Lys Pro lie Asn Val 
65 70 75 80 

gca att ttt gga agt act ggt agr. ata ggt acg aat get tta aat ata 288 
Ala lie Phe Gly Ser Thr Gly Ser lie Gly Thr Asn Ala Leu Asn lie 
85 90 95 

ata agg gag tgt aat aaa att gaa aat gtt ttt aat gtt aaa gca ttg 336 
He Arg Glu Cys Asn Lys He Glu Asn Val Phe Asn Val Lys Ala Leu 
100 105 110 

tat gtg aat aag agt gtg aat gaa tta tat gaa caa get aga gaa ttt 384 
Tyr Val Asn Lys Ser Val Asn Glu Leu Tyr Glu Gin Ala Arg Glu Phe 
115 120 125 

tta cca gaa tat ttg tgt ata cat gat aaa agt gta tat gaa gaa tta 432 
Leu Pro Glu Tyr Leu Cys He His Asp Lys Ser Val Tyr Glu Glu Leu 
130 135 140 

aaa gaa ctg gta aaa aat ata aaa gat tat aaa cct ata ata ttg tgt 480 
Lys Glu Leu Val Lys Asn He Lys Asp Tyr Lys Pro He He Leu Cys 
145 150 155 160 

ggt gat gaa ggg atg aaa gaa ata tgt agt agt aat agt ata gat aaa 528 
Gly Asp Glu Gly Met Lys Glu He Cys Ser Ser Asn Ser He Asp Lys 
165 170 175 

ata gtt att ggt att gat tct ttt caa gga tta tat tct act atg tat 576 
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He Val He Gly He Asp S r Phe Gin Gly Leu Tyr Ser Thr Met Tyr 
180 185 190 



gca att atg aat aat aaa ata gtt gcg tta get aat aaa gaa tec att 624 
Ala lie Met Asn Asn Lys lie Val Ala Leu Ala Asn Lys Glu Ser He 
195 200 205 

gtc tct get ggt ttc ttt tta aag aaa tta tta aat att cat aaa aat 672 
Val Ser Ala Gly Phe Phe Leu Lys Lys Leu Leu Asn lie His Lys Asn 
210 215 220 

gca aag ata ata cct gtt gat tea gaa cat agt . get ata ttt caa tgt 720 
Ala Lys lie lie Pro Val Asp Ser Glu His Ser Ala lie Phe Gin Cys 
225 230 235 240 

tta gat aat aat aag gta tta aaa aca aaa tgt tta caa gac aat ttt 768 
Leu Asp Asn Asn Lys Val Leu Lys Thr Lys Cys Leu Gin Asp Asn Phe 
245 250 255 

tct aaa att aac aat ata aat aaa ata ttt tta tgt tea tct gga ggt • 816 
Ser Lys He Asn Asn lie Asn Lys He Phe Leu Cys Ser Ser Gly 6ly 
260 265 270 

cca ttt caa aat tta act atg gac gaa tta aaa aat gta aca tea gaa 864 
Pro Phe Gin Asn Leu Thr Met Asp Glu Leu Lys Asn Val Thr Ser Glu 
275 280 285 

aat get tta aag cat cct aaa tgg aaa atg ggt aag aaa ata act ata 912 
Asn Ala Leu Lys His Pro Lys Trp Lys Met Gly Lys Lys lie Thr lie 
290 295 300 

gat tct gca act atg atg aat aaa ggt tta gag gtt ata gaa acc cat 960 
Asp Ser Ala Thr Met Met Asn Lys Gly Leu Glu Val lie Glu Thr His 
305 310 315 320 

ttt tta ttt gat gta gat tat aat gat ata gaa gtt ata gta cat aaa 1008 
Phe Leu Phe Asp Val Asp Tyr Asn Asp lie Glu Val lie Val His Lys 
325 330 335 

gaa tgc att ata cat tct tgt gtt gaa ttt ata gac aaa tea gta ata 1056 
Glu Cys Ile s lie His Ser Cys Val Glu Phe He Asp Lys Ser Val He 
340 345 350 
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agt caa atg tat tat cca gat atg caa ata ccc ata tta tat tct tta 1104 

Ser Gin Met Tyr Tyr Pro Asp Met Gin He Pro He Leu Tyr Ser Leu 
355 360 365 

aca tgg cct gat aga ata aaa aca aat tta aaa cct tta gat ttg get 1152 

Thr Trp Pro Asp Arg He Lys Thr Asn Leu Lys Pro Leu Asp Leu Ala 
370 375 380 

cag gtt tea act ctt aca ttt cat aaa cct tct tta gaa cat ttc ccg 1200 

Gin Val Ser Thr Leu Thr Phe His Lys Pro Ser Leu Glu His Phe Pro 
235 390 395 400 

tgt att aaa tta get tat caa gca ggt ata aaa gga aac ttt tat cca 1248 

Cys lie Lys Leu Ala Tyr Gin Ala Gly He Lys Gly Asn Phe Tyr Pro 
405 410 415 

acz gta eta aat gcg tea aat gaa ata get aac aac tta ttt ttg aat 1296 

Thr Val Leu Asn Ala Ser Asn Glu He Ala Asn Asn Leu Phe Leu Asn 
420 425 430 

aat aaa att aaa tat ttt gat att tec tct ata ata teg caa gtt ctt 1344 

Asn Lys He Lys Tyr Phe Asp He Ser Ser He He Ser Gin Val Leu 
435 440 445 

gaa ret ttc aat tct caa aag gtt teg gaa aat agt gaa gat tta atg 1392 

Glu Ser Phe Asn Ser Gin Lys Val Ser Glu Asn Ser Glu Asp Leu Met 
450 455 460 

aag caa att eta caa ata cat tct tgg gec aaa gat aaa get acc gat 1440 

Lys Gin lie Leu Gin lie His Ser Trp Ala Lys Asp Lys Ala Thr Asp 
465 470 475 480 

ata tac aac aaa cat aat tct tea tag 1467 

lie Tyr Asn Lys His Asn Ser Ser 
485 



<210> 2 
<211> 488 
<212> PRT 

<213> Plasmodium falciparum 
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<400> 2 

Met Lys Lys Tyr lie Tyr lie Tyr Phe Phe Phe He Thr He Thr He 
15 10 15 

Asn Asp Leu V'al He Asn Asn Thr Ser Lys Cys Val Ser He Glu Arg 
20 25 30 

Arg Lys Asn Asn Ala Tyr lie Asn Tyr Gly He Gly Tyr Asn Gly Pro 
35 40 45 

Asp Asn Lys He Thr Lys Ser Ar.g Arg Cys Lys Arg He Lys Leu Cys 
50 55 60 

Lys Lys Asp Leu He Asp He Gly Ala He Lys Lys Pro lie Asn Val 
65 70 75 80 

Ala lie Phe Gly Ser Thr Gly Ser He Gly Thr Asn Ala Leu Asn lie 
85 90 95 

He Arg Glu Cys Asn Lys He Glu Asn Val Phe Asn Val Lys Ala Leu 
100 105 110 

Tyr Val Asn Lys Ser Val Asn Glu Leu Tyr Glu Gin Ala Arg Glu Phe 
115 120 125 

Leu Pro Glu Tyr Leu Cys lie His Asp Lys Ser Val Tyr Glu Glu Leu 
130 135 140 

Lys Glu Leu Val Lys Asn lie Lys Asp Tyr Lys Pro lie lie Leu Cys 
145 150 155 160 

Gly Asp Glu Gly Met Lys Glu He Cys Ser Ser Asn Ser lie Asp Lys 
165 170 175 

lie Val He Gly lie Asp Ser Phe Gin Gly Leu Tyr Ser Thr Met Tyr 
180 185 190 

Ala He Met Asn Asn Lys He Val Ala Leu Ala Asn Lys Glu Ser He 
195 200 205 

Val Ser Ala Gly Phe Phe Leu Lys Lys Leu Leu Asn lie His Lys Asn 
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210 215 220 



Ala Lys lie lie Pro Val Asp Ser Glu His Ser Ala lie Phe Gin Cys 
225 230 235 240 

Leu Asp Asn Asn Lys Val Leu Lys Thr Lys Cys Leu Gin Asp Asn Phe 
245 250 255 

Ser Lys lie Asn Asn He Asn Lys He Phe Leu Cys Ser Ser Gly Gly 
260 265 270 

Fro Phe Gin Asn Leu Thr Met Asp Glu Leu Lys Asn Val Thr Ser Glu 
275 280 . 285 

Asn Ala Leu Lys His Pro Lys Trp Lys Met Gly Lys Lys He Thr He 
290 295 300 

Asp Ser Ala Thr Met Met Asn Lys Gly Leu Glu Val He Glu Thr His 
305 310 315 320 

Phe Leu Phe Asp Val Asp Tyr Asn Asp He Glu Val He Val His Lys 
325 330 335 

Glu Cys He He His Ser Cys Val Glu Phe lie Asp Lys Ser Val He 
340 345 350 

Ser Gin Met Tyr Tyr Pro Asp Met Gin He Pro He Leu Tyr Ser Leu 
355 360 365 

Thr Trp Pro Asp Arg lie Lys Thr Asn Leu Lys Pro Leu Asp Leu Ala 
370 375 380 

Gin Val Ser Thr Leu Thr Phe His Lys Pro Ser Leu Glu His Phe Pro 
385 390 395 400 

Cys He Lys Leu Ala Tyr Gin Ala Gly He Lys Gly Asn Phe Tyr Pro 
405 410 415 

Thr Val Leu Asn Ala Ser Asn Glu He Ala Asn Asn Leu Phe Leu Asn 
420 425 430 

Asn Lys He Lys Tyr Phe Asp He Ser Ser He He Ser Gin Val Leu 
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435 440 445 

Glu Ser Phe Asn Ser Gin Lys Val Ser Glu Asn Ser Glu Asp Leu Met 
450 455 460 

Lys Gin lie Leu Gin lie His Ser Trp Ala Lys Asp Lys Ala Thr Asp 
465 470 475 480 

lie Tyr Asn Lys His Asn Ser Ser 
485 



<210> 3 
<211> 3872 
<212> DNA 

<213> Plasmodium falciparum 

<220> 
<221> CDS 

<222> (126) . . (3740) 
<220> 

<221> gene 

<222> (1) . . (3870) 

<220> 

<221> mRNA 

<222> (1) . . (3870) 

<AOO> 3 

ggtaatatac gtataatata tatataatat attcttacgt atgtatcatt tatgaatcat 60 

aataatattc taaatttacc ttccgttttt gctcgatctt ctcattttcg tttcagcttt 120 

tatca atg att ttt aat tat gtg ttt ttt aag aac ttt gta cca gtt gtt 170 
Met lie Phe Asn Tyr Val Phe Phe Lys Asn Phe Val Pro Val Val 
1 5 10 15 

eta tac att etc ctt ata ata tat att aac tta aat ggc atg aat aat 218 
Leu Tyr lie Leu Leu lie He Tyr lie Asn Leu Asn Gly Met Asn Asn 
20 25 30 
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aaa aat caa ata aaa aca gaa aaa att tat ata aag aaa ttg aat agg 266 
Lys Asn Gin lie Lys Thr Glu Lys lie Tyr lie Lys Lys Leu Asn Arg 

35 40 ' 45 

ttg tea agg aaa aat teg tta tgt agt tct aaa aat aaa ata gca tgc 314 
Leu Ser Arg Lys Asn Ser Leu Cys Ser Ser Lys Asn Lys lie Ala Cys 
50 55 60 

ttg ttc gat ata gga aat gat gat aat aga aat acg aca tat ggc tat 362 
Leu Phe Asp lie Gly Asn Asp Asp Asn Arg Asn Thr Thr Tyr Gly Tyr 
65 70 75 

aat gtg aat gtt aaa aat gat gat att aat tec tta eta aaa aat aat 410 
Asn Val Asn Val Lys Asn Asp Asp He Asn Ser Leu Leu Lys Asn Asn 
80 85 90 95 

tat agt aat aaa ttg tac atg gat aag agg aaa aat att aat aat gta 458 
Tyr Ser Asn Lys Leu Tyr Met Asp Lys Arg Lys Asn He Asn Asn Val 
100 105 110 

att agt act aat aaa ata tct ggg tec att tea aat att tgt agt aga 506 
He Ser Thr Asn Lys He Ser Gly Ser He Ser Asn He Cys Ser Arg 
115 120 125 

aat caa aaa gaa aat gaa caa aaa aga aat aaa caa aga tgt tta act 554 
Asn Gin Lys Glu Asn Glu Gin Lys Arg Asn Lys Gin Arg Cys Leu Thr 
130 135 140 

caa tgt cac act tat aat atg tea cat gaa cag gac aaa eta get aat 602 
Gin Cys His Thr Tyr Asn Met Ser His Glu Gin Asp Lys Leu Ala Asn 
145 150 155 

gat aat aat agg aat aat aaa aag aat ttt aat tta tta ttt ata aat 650 
Asp Asn Asn Arg Asn Asn Lys Lys Asn Phe Asn Leu Leu Phe He Asn 
160 165 170 175 

tat ttt aat ttg aaa cga atg aaa aat tct ctt eta aat aaa gac aat 698 
Tyr Phe Asn Leu Lys Arg Met Lys Asn Ser Leu Leu Asn Lys Asp Asn 
180 185 190 

tec ttt tac tgt aaa gaa aaa aaa ttg tea ttt ctg cat aag gee tat 746 
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Phe Phe Tyr Cys Lys Glu Lys Lys Leu Ser Phe Leu His Lys Ala Tyr 
195 200 205 

aaa aaa aaa aat tgc act ttt caa aat tat agt tta aaa aga aaa tct 794 

Lys Lys Lys Asn Cys Thr Phe Gin Asn Tyr Ser Leu Lys Arg Lys Ser 
210 215 220 

aat cgt gat tea cat aaa ttg ttt tct gga gaa ttt gac gat tat aca 842 

Asn Arg Asp Ser His Lys Leu Phe Ser Gly Glu Phe Asp Asp Tyr Thr 
225 230 235 

aat aat aat get tta tat gaa tec gaa aaa aaa gaa tac att aca eta 890 

Asn Asn Asn Ala Leu Tyr Glu Ser Glu Lys Lys Giu Tyr lie Thr Leu 

240 245 250 255 

aat aat aat aat aaa aat aat aat aat aaa aat aat gat aat aaa aat 938 

Asn Asn Asn Asn Lys Asn Asn Asn Asn Lys Asn Asn Asp Asn Lys Asn 

260 265 270 

aat gat aat aat gat tat aat aat aat aat agt tgt aat aat tta gga 986 

Asn Asp Asn Asn Asp Tyr Asn Asn Asn Asn Ser Cys Asn Asn Leu Gly 
275 280 285 

gag aga tec aat cat tat gat aat tat ggt gga gat aat aat aat cca 1034 

Glu Arg Ser Asn His Tyr Asp Asn Tyr Gly Gly Asp Asn Asn Asn Pro 
290 295 300 

tgt aat aat aat aat gac aaa tat gat ata gga aaa tat ttc aaa cag 1082 

Cys Asn Asn Asn Asn Asp Lys Tyr Asp lie Gly Lys Tyr Phe Lys Gin 
305 310 315 

att aat acc ttt att aat att gat gaa tat aaa act ata tat ggt gat 1130 

He Asn Thr Phe He Asn He Asp Glu Tyr Lys Thr He Tyr Gly Asp 

320 325 330 335 

gaa ata tat aaa gaa ata tat gaa eta tat gta gaa aga aat att cct 1178 

Glu He Tyr Lys Glu He Tyr Glu Leu Tyr Val Glu Arg Asn He Pro 

340 345 350 

gaa tat tat gaa cga aaa tat ttt tea gaa gat att aaa aag agt gtc 1226 

Glu Tyr Tyr Glu Arg Lys Tyr Phe Ser Glu Asp He Lys Lys Ser Val 
355 360 365 
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eta ttt gat ata gat aaa tat aat gat gtc gaa ttt gaa aaa get ata 1274 
Leu Phe Asp lie Asp Lys Tyr Asn Asp Val Glu Phe Glu Lys Ala lie 
370 375 380 

aaa gaa gaa ttt ata aat aat gga gtt tat att aat aat ata gat aat 1322 
Lys Glu Glu Phe lie Asn Asn Gly Val Tyr lie Asn Asn lie Asp Asn 
385 390 395 

aca tat tat aaa aaa gaa aat att tta ata atg aaa aag ata tta cat 1370 
Thr Tyr Tyr Lys Lys Glu Asn lie Leu lie Met Lys Lys lie Leu His 
400 405 410 415 

tat ttc cca eta tta aaa tta att aat aat cca tea gat tta aaa aag 1418 
Tyr Phe Pro Leu Leu Lys Leu lie Asn Asn Pro Ser Asp Leu Lys Lys 
420 425 430 

tta aaa aaa caa tat tta cct tta tta gca cat gaa tta aaa ata ttt 1466 
Leu Lys Lys Gin Tyr Leu Pro Leu Leu Ala His Glu Leu Lys lie Phe 
435 440 445 

tta ttt ttt att gta aat ata aca gga ggt cat ttt tec tct gtt tta 1514 
Leu Phe Phe lie Val Asn lie Thr Gly Gly His Phe Ser Ser Val Leu 
450 455 460 

age tct tta gaa att caa tta tta tta ttg tat att ttt aat caa cca 1562 
Ser Ser Leu Glu lie Gin Leu Leu Leu Leu Tyr lie Phe Asn Gin Pro 
465 470 475 

tat gat aat gtt ata tat gat ata gga cat caa gca tat gta cat aag 1610 
Tyr Asp Asn Val He Tyr Asp He Gly His Gin Ala Tyr Val His Lys 
480 485 490 495 



1658 



ata ttg acc gga aga aaa eta tta ttt eta tea tta aga aat aaa aaa 
He Leu Thr Gly Arg Lys Leu Leu Phe Leu Ser Leu Arg Asn Lys Lys 
500 505 510 



ggt att agt gga ttc eta aat att ttt gaa agt att tat gat aaa ttt 1706 
Gly He Ser Gly Phe Leu Asn He Phe Glu Ser lie Tyr Asp Lys Phe 
515 520 525 



ggg get ggt cac agt tec act tea tta agt get ata caa gga tat tat 



1754 
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530 

gaa gcc gag tgg caa gtg 
Glu Ala Glu Trp Gin Val 
545 
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Thr Ser Leu Ser Ala lie Gin 
535 540 

aag aat aaa gaa aaa tat gga 
Lys Asn Lys Glu Lys Tyr Gly 
550 555 



PCT/EP99/07055 

Gly Tyr Tyr 

aat gga gat 1802 
Asn Gly Asp 



ata gaa ata agt gat aac gca aat gtc acg aat aat gaa agg ata ttt 1850 
lie Glu lie Ser Asp Asn Ala Asn Val Thr Asn Asn Glu Arg lie Phe 
560 565 570 575 



caa aaa gga ata cac aat gat aat aat att aac aat aat att aat aat 1898 
Gin Lys Gly lie His Asn Asp Asn Asn lie Asn Asn Asn lie Asn Asn 
580 585 590 



aat aat tat ate aat cct tea gat gtg gta gga aga gaa aat acg aat 194 6 
Asn Asn Tyr He Asn Pro Ser Asp Val Val Gly Arg Glu Asn Thr Asn 
595 600 605 



gta cca aat gta cga aat gat aac cat aac gtg gat aaa gta cac att 1994 
Val Pro Asn Val Arg Asn Asp Asn His Asn Val Asp Lys Val His He 
610 615 620 



get att ata gga gat ggt ggt tta aca ggt gga atg gca tta gaa gcg 2042 
Ala He He Gly Asp Gly Gly Leu Thr Gly Gly Met Ala Leu Glu Ala 
625 630 635 



tta aat tat att tea ttc ttg aat 
Leu Asn Tyr He Ser Phe Leu Asn 
640 645 

gat aac gga caa gtt tct tta cca 
Asp Asn Gly Gin Val Ser Leu Pro 
660 



tct aaa att tta att att tat aat 2090 

Ser Lys He Leu He lie Tyr Asn 
650 655 

aca aat gcc gta agt ata tea ggt 2138 

Thr Asn Ala Val Ser lie Ser Gly 

665 670 



aat aga cct ata ggt tct ata tea gat cat tta cat tat ttt gtt tct 2186 
Asn Arg Pro He Gly Ser He Ser Asp His Leu His Tyr Phe Val Ser ' 
675 680 685 



aat ata gaa gca aat get ggt gat aat aaa tta teg aaa aat gca aaa 2234 
Asn He Glu Ala Asn Ala Gly Asp Asn Lys Leu Ser Lys Asn Ala Lys 
690 695 700 
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gag aat aac att ttt gaa aat ttg aat tat gat tat att ggt <jtt gtg 2282 
Glu Asn Asn lie Phe Glu Asn Leu Asn Tyr Asp Tyr lie Gly Val Val 
705 710 715 

aat ggt aat aat aca gaa gag etc ttt aaa gta tta aat aat ata aaa 2330 
Asn Gly Asn Asn Thr Glu Glu Leu Phe Lys Val Leu Asn Asn lie Lys 
720 725 730 735 

gaa aat aaa tta aaa aga get act gtt ctt cat gta cgt aca aaa aaa 2378 
Glu Asn Lys Leu Lys Arg Ala Thr Val Leu His Val Arg Thr Lys Lys 
740 745 750 

teg aat gat ttt ata aat tea aag agt cca ata agt ata ttg cac tct 2426 
Ser Asn Asp Phe He Asn Ser Lys Ser Pro He Ser He Leu His Ser 
755 760 765 

ata aag aaa aat gag att ttc cct ttc gat ace act ata tta aat gga 2474 
He Lys Lys Asn Glu He Phe Pro Phe Asp Thr Thr He Leu Asn Gly 
770 775 780 

aat att cat aag gag aac aag ata gaa gaa gag aaa aat gtg tct tea 2522 
Asn He His Lys Glu Asn Lys He Glu Glu Glu Lys Asn Val Ser Ser 
785 790 795 

tct aca aag tat gat gta aat aat aag aat aat aaa aat aat gat aat 2570 
Ser Thr Lys Tyr Asp Val Asn Asn Lys Asn Asn Lys Asn Asn Asp Asn 
800 805 810 815 

agt gaa att ata aaa tat gaa gat atg ttt tea aaa gag acg ttc aca 2618 
Ser Glu He lie Lys Tyr Glu Asp Met Phe Ser Lys Glu Thr Phe Thr 
820 825 830 

gat ata tat aca aat gaa atg tta aaa tat tta aag aaa gat aga aat 2666 
Asp He Tyr Thr Asn Glu Met Leu Lys Tyr Leu Lys Lys Asp Arg Asn 
835 840 845 

ata ata ttc eta tct ccc get atg tta gga gga tea gga ttg gtt aaa 2714 
lie lie Phe Leu Ser Pro Ala Met Leu Gly Gly Ser Gly Leu Val Lys 
850 855 860 

att agt gag cgt tat cca aat aat gta tat gat gta ggt ata gca gaa 2762 
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lie Ser Glu Arg Tyr Pro Asn Asn Val Tyr Asp Val Gly lie Ala Glu 
865 870 875 

caa cat tct gta act ttc gca gca get atg gca atg aat aag aaa tta 2810 
Gin His Ser Val Thr Phe Ala Ala Ala Met Ala Met Asn Lys Lys Leu 
880 885 890 895 

aaa ata caa tta tgt ata tat teg ace ttt tta caa aga gca tat gat 2858 
Lys lie Gin Leu Cys lie Tyr Ser Thr Phe Leu Gin Arg Ala Tyr Asp 
900 905 910 

caa att ata cat gat ctt aat tta caa aat ata cct tta aag gtt ata 2906 
Gin lie He His Asp Leu Asn Leu Gin Asn He Pro Leu Lys Val He 
915 920 925 

att gga aga agt gga tta gta gga gag gat ggg gca aca cat caa ggt 2954 
He Gly Arg Ser Gly Leu Val Gly Glu Asp Gly Ala Thr His Gin Gly 
930 935 940 

ata tat gat tta tct tat ctt ggg aca ctt aac aat gca tat ata ata 3002 
lie Tyr Asp Leu Ser Tyr Leu Gly Thr Leu Asn Asn Ala Tyr lie lie 
945 950 955 

tct cca agt aat caa gtt gat ttg aaa aga get ctt agg ttt get tat 3050 
Ser Pro Ser Asn Gin Val Asp Leu Lys Arg Ala Leu Arg Phe Ala Tyr 
960 965 970 975 

tta gat aag gac cat tct gtg tat ata cgt ata ccc aga atg aac ata 3098 
Leu Asp Lys Asp His Ser Val Tyr lie Arg lie Pro Arg Met Asn lie 
980 985 990 

, tta agt gat aag tac atg aaa gga tat ttg aac att cat atg aaa aat 3146 
Leu Ser Asp Lys Tyr Met Lys Gly Tyr Leu Asn lie His Met Lys Asn 
995 1000 1005 

gag age aaa aat ate gat gta aac gtg gat ata aac gat gat gta gat 3194 
Glu Ser Lys Asn lie Asp Val Asn Val Asp lie Asn Asp Asp Val Asp 
1010 1015 1020 

aaa tat agt gaa gaa tat atg gac gat gat aat ttt ata aaa teg ttt 3242 
Lys Tyr Ser Glu Glu Tyr Met Asp Asp Asp Asn Phe lie Lys Ser Phe 
1025 1030 1035 
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att gga aaa tct aga att att aaa atg gat aat gaa aat aat aat aca 3290 
He Gly Lys Ser Arg He He Lys Met Asp Asn Glu Asn Asn Asn Thr 
1040 1045 1050 1055 

aat gaa cat tat tea age aga gga gat aca cag aca aaa aaa aaa aaa 3338 
Asn Glu His Tyr Ser Ser Arg Gly Asp Thr Gin Thr Lys Lys Lys Lys 
1060 1065 1070 

gtt tgt ate ttt aac atg ggt agt atg ctt ttt aat gta att aat get 3386 
Val Cys He Phe Asn Met Gly Ser Met Leu Phe Asn Val lie Asn Ala 
1075 1080 1085 

aca aaa gaa att gaa aaa gaa caa tat att tea cat aat tat tct ttt 3434 
lie Lys Glu lie Glu Lys Glu Gin Tyr He Ser His Asn Tyr Ser Phe 
1090 1095 1100 

tea att gtt gat atg ata ttt tta aat cct tta gat aaa aat atg ata 3482 
Ser lie Val Asp Met lie Phe Leu Asn Pro Leu Asp Lys Asn Met lie 
1105 1110 1115 

gat cat gta ata aaa caa aat aaa cat caa tat tta att act tat gaa 3530 
Asp His Val He Lys Gin Asn Lys His Gin Tyr Leu lie Thr Tyr Glu 
1120 1125 1130 1135 

gat aat act ata ggt ggt ttt tct aca cat ttc aat aat tat tta ata 3578 
Asp Asn Thr He Gly Gly Phe Ser Thr His Phe Asn Asn Tyr Leu He 
1140 1145 1150 

gaa aat aat tat att aca aaa cat aac tta tat gtt cat aat att tat 3626 
Glu Asn Asn Tyr He Thr Lys His Asn Leu Tyr Val His Asn lie Tyr 
1155 1160 1165 

tta tct aat gag cca att gaa cat gca tct ttt aag gat caa caa gaa 3674 
Leu Ser Asn Glu Pro He Glu His Ala Ser Phe Lys Asp Gin Gin Glu 
1170 1175 1180 

gtc gtc aaa atg gat aaa tgt agt ctt gtc aat aga att aaa aat tat 3722 
Val Val Lys Met Asp Lys Cys Ser Leu Val Asn Arg lie Lys Asn Tyr 
1185 1190 1195 

ctt aaa aat aat cct aca tgatgtaaga taaatatata tttctaaaat 3770 
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Leu Lys Asn Asn Pro Thr 
1200 1205 

tatttttttt ttatacttta atgtgtacaa taaaatatat atctaaatat attttatttg 3830 
tacgcttttt tttttttttt tttaattgtt atttttgtat at 3872 



<210> 4 
<211> 1205 
<212> PRT 

<2I3> Plasmodium falciparum 
<400> 4 

Met lie Phe Asn Tyr Val Phe Phe Lys Asn Phe Val Pro Val Val Leu 
15 10 15 

Tyr lie Leu Leu lie lie Tyr lie Asn Leu Asn Gly Met Asn Asn Lys 
20 25 30 

Asn Gin He Lys Thr Glu Lys He Tyr He Lys Lys Leu Asn Arg Leu 
35 40 45 

Ser Arg Lys Asn Ser Leu Cys Ser Ser Lys Asn Lys He Ala Cys Leu 
50 55 60 

Phe Asp He Gly Asn Asp Asp Asn Arg Asn Thr Thr Tyr Gly Tyr Asn 
65 70 75 80 

Val Asn Val Lys Asn Asp Asp He Asn Ser Leu Leu Lys Asn Asn Tyr 
85 90 95 

Ser Asn Lys Leu Tyr Met Asp Lys Arg Lys Asn lie Asn Asn Val He 
100 105 110 

Ser Thr Asn Lys He Ser Gly Ser lie Ser Asn lie Cys Ser Arg Asn 
115 120 125 

Gin Lys Glu Asn Glu Gin Lys Arg Asn Lys Gin Arg Cys Leu Thr Gin 
130 135 140 

Cys His Thr Tyr Asn Met Ser His Glu Gin Asp Lys Leu Ala Asn Asp 
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145 150 155 160 



Asn Asn Arg Asn Asn Lys Lys Asn Phe Asn Leu Leu Phe lie Asn Tyr 
165 170 175 

Phe Asn Leu Lys Arg Met Lys Asn Ser Leu Leu Asn Lys Asp Asn Phe 
180 185 190 

Phe Tyr Cys Lys Glu Lys Lys Leu Ser Phe Leu His Lys Ala Tyr Lys 
195 200 205 

Lys Lys Asn Cys Thr Phe Gin Asn Tyr Ser Leu Lys Arg Lys Ser Asn 
210 215 220 

Arg Asp Ser His Lys Leu Phe Ser Gly Glu Phe Asp Asp Tyr Thr Asn 
225 230 235 240 

Asn Asn Ala Leu Tyr Glu Ser Glu Lys Lys Glu Tyr lie Thr Leu Asn 
245 250 255 

Asn Asn Asn Lys Asn Asn Asn Asn Lys Asn Asn Asp Asn Lys Asn Asn 
260 265 270 

Asp Asn Asn Asp Tyr Asn Asn Asn Asn Ser Cys Asn Asn Leu Gly Glu 
275 280 285 

Arg Ser Asn His Tyr Asp Asn Tyr Gly Gly Asp Asn Asn Asn Pro Cys 
290 295 300 

Asn Asn Asn Asn Asp Lys Tyr Asp lie Gly Lys Tyr Phe Lys Gin He 
305 < 310 315 320 

Asn Thr Phe He Asn He Asp Glu Tyr Lys Thr He Tyr Gly Asp Glu 
325 330 335 

He Tyr Lys Glu He Tyr Glu Leu Tyr Val Glu Arg Asn He Pro Glu 
340 345 350 

Tyr Tyr Glu Arg Lys Tyr Phe Ser Glu Asp lie Lys Lys Ser Val Leu 
355 360 365 

Phe Asp He Asp Lys Tyr Asn Asp Val Glu Phe Glu Lys Ala He Lys 
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370 375 380 



Glu Glu Phe He Asn Asn Gly Val Tyr He Asn Asn lie Asp Asn Thr 
385 390 395 400 

Tyr Tyr Lys Lys Glu Asn He Leu He Met Lys Lys He Leu His Tyr 
405 410 415 

Phe Pro Leu Leu Lys Leu He Asn Asn Pro Ser Asp Leu Lys Lys Leu 
420 425 430 

Lys Lys Gin Tyr Leu Pro Leu Leu Ala His Glu Leu Lys He Phe Leu 
435 440 445 

Phe Phe He Val Asn He Thr Gly Gly His Phe Ser Ser Val Leu Ser 
450 455 460 

Ser Leu Glu He Gin Leu Leu Leu Leu Tyr lie Phe Asn Gin Pro Tyr 
465 470 475 480 

Asp Asn Val He Tyr Asp He Gly His Gin Ala Tyr Val His Lys He 
485 490 495 

Leu Thr Gly Arg Lys Leu Leu Phe Leu Ser Leu Arg Asn Lys Lys Gly 
500 505 510 

He Ser Gly Phe Leu Asn He Phe Glu Ser He Tyr Asp Lys Phe Gly 
515 520 525 

Ala Gly His Ser Ser Thr Ser Leu Ser Ala He Gin Gly Tyr Tyr Glu 
530 535 540 

.Ma Glu Trp Gin Val Lys Asn Lys Glu Lys Tyr Gly Asn Gly Asp He 
545 550 555 560 

Glu He Ser Asp Asn Ala Asn Val Thr Asn Asn Glu Arg lie Phe Gin 
565 570 575 

Lys Gly lie His Asn Asp Asn Asn lie Asn Asn Asn lie Asn Asn Asn 
580 585 590 

Asn Tyr He Asn Pro Ser Asp Val Val Gly Arg Glu Asn Thr Asn Val 
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595 600 €05 



Pro Asn Val Arg Asn Asp Asn His Asn Val Asp Lys Val His lie Ala 
610 615 620 

lis lie Gly Asp Gly Gly Leu Thr Gly Gly Met Ala Leu Glu Ala Leu 
625 630 635 640 

Asn Tyr lie Ser Phe Leu Asn Ser Lys lie Leu lie lie Tyr Asn Asp 
645 650 655 

Asn Gly Gin Val Ser Leu Pro Thr Asn Ala Val Ser lie Ser Gly Asn 
660 665 670 

Arg Pro lie Gly Ser lie Ser Asp His Leu His Tyr Phe Val Ser Asn 
675 680 685 

He Glu Ala Asn Ala Gly Asp Asn Lys Leu Ser Lys Asn Ala Lys Glu 
690 695 700 

Asn Asn He Phe Glu Asn Leu Asn Tyr Asp Tyr He Gly Val Val Asn 
705 710 715 720 

Gly Asn Asn Thr Glu Glu Leu Phe Lys Val Leu Asn Asn He Lys Glu 
725 730 735 

Asn Lys Leu Lys Arg Ala Thr Val Leu His Val Arg Thr Lys Lys Ser 
740 745 750 

Asn Asp Phe He Asn Ser Lys Ser Pro He Ser He Leu His Ser He 
755 760 765 

Lys Lys Asn Glu He Phe Pro Phe Asp Thr Thr He Leu Asn Gly Asn 
770 775 780 

He His Lys Glu Asn Lys He Glu Glu Glu Lys Asn Val Ser Ser Ser 
785 790 795 800 

Thr Lys Tyr Asp Val Asn Asn Lys Asn Asn Lys Asn Asn Asp Asn Ser 
805 810 815 

Glu He He Lys Tyr Glu Asp Met Phe Ser Lys Glu Thr Phe Thr Asp 
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820 825 830 . 

lie Tyr Thr Asn Glu Met Leu Lys Tyr Leu Lys Lys Asp Arg Asn lie 
835 840 845 

lie Phe Leu Ser Pro Ala Met Leu Gly Gly Ser Gly Leu Val Lys He 
850 855 860 

Ser Glu Arg Tyr Pro Asn Asn Val Tyr Asp Val Gly He Ala Glu Gin 
865 870 875 880 

His Ser Val Thr Phe Ala Ala Ala Met Ala Met Asn Lys Lys Leu Lys 
885 890 895 

He Gin Leu Cys He Tyr Ser Thr Phe Leu Gin Arg Ala Tyr Asp Gin 
900 905 910 

He He His Asp Leu Asn Leu Gin Asn He Pro Leu Lys Val lie lie 
915 920 925 

Gly Arg Ser Gly Leu Val Gly Glu Asp Gly Ala Thr His Gin Gly He 
930 935 940 

Tyr Asp Leu Ser Tyr Leu Gly Thr Leu Asn Asn Ala Tyr He lie Ser 
945 950 955 960 

Pro Ser Asn Gin Val Asp Leu Lys Arg Ala Leu Arg Phe Ala Tyr Leu 
965 970 975 

Asp Lys Asp His Ser Val Tyr He Arg lie Pro Arg Met Asn He Leu 
980 985 990 

Ser Asp Lys Tyr Met Lys Gly Tyr Leu Asn lie His Met Lys Asn Glu 
995 1000 1005 

Ser Lys Asn He Asp Val Asn Val Asp lie Asn Asp Asp Val Asp Lys 
1010 1015 1020 

Tyr Ser Glu Glu Tyr Met Asp Asp Asp Asn Phe lie Lys Ser Phe lie 
025 1030 1035 1040 

Gly Lys Ser Arg lie lie Lys Met Asp Asn Glu Asn Asn Asn Thr Asn 
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1045 . 1050 1055 

Glu His Tyr Ser Ser Arg Gly Asp Thr Gin Thr Lys Lys Lys Lys Val 
1060 1065 1070 

Cys lie Phe Asn Met Gly Ser Met Leu Phe Asn Val lie Asn Ala lie 
1075 1080 1085 

Lys Glu lie Glu Lys Glu Gin Tyr lie Ser His Asn Tyr Ser Phe Ser 
1090 1095 1100 

lie Val Asp Met He Phe Leu Asn Pro Leu Asp Lys Asn Met He Asp 
105 1110 1115 1120 

His Val He Lys Gin Asn Lys His Gin Tyr Leu He Thr Tyr Glu Asp 
1125 1130 1135 

Asn Thr He Gly Gly Phe Ser Thr His Phe Asn Asn Tyr Leu He Glu 
1140 1145 1150 

Asn Asn Tyr He Thr Lys His Asn Leu Tyr Val His Asn He Tyr Leu 
1155 1160 1165 

Ser Asn Glu Pro He Glu His Ala Ser Phe Lys Asp Gin Gin Glu Val 
1170 1175 1180 

Val Lys Met Asp Lys Cys Ser Leu Val Asn Arg He Lys Asn Tyr Leu 
185 1190 1195 1200 

Lys Asn Asn Pro Thr 
1205 



<210> 5 
<211> 3147 
<212> DNA 

<213> Plasmodium falciparum 

<220> 
<221> CDS 

<222> (199) . . (2670) 



BNSOOCID: <WO 0017233A2_L> 



- WO 00/17233 PCT/EP99/07055 

<400> 5 

tttcattttt ctttacccac atatatatat atatatatat aatatatata tataatatta 60 

tatatttgat atatgattta aaattgtaac ataaaaaaaa taattatatt aaatatgtgt 120 

atacatctcc aacatataaa tattattttt tattattatt tttttttttt tttttcataa 180 

tgcctgaata accacaaa atg agt tat ata aaa aga ctg att ctt ttt atg 231 

Met Ser Tyr He Lys Arg Leu He Leu Phe Met 
15 10 

tea ctg ttt tat tct cat gta aaa att aaa aaa tta ttt att aaa att 279 

Leu Leu Phe Tyr Ser His Val Lys He Lys Lys Leu Phe He Lys He 
15 20 25 

tct aat gta aac ata ttt ttt gca gaa gca aag aaa aat gga aaa aag 327 

Ser Asn Val Asn lie Phe Phe Ala Glu Ala Lys Lys Asn Gly Lys Lys 

30 35 40 

gaa ttc ttt ctt ttt tta eta aat ata aaa aaa aat age caa cag aaa 375 

Glu Phe Phe Leu Phe Leu Leu Asn He Lys Lys Asn Ser Gin Gin Lys 

45 50 55 

aaa act tat cat att acc aaa agg aat acc ata aat aaa agt gat ttt 423 

Lys Thr Tyr His He Thr Lys Arg Asn Thr He Asn Lys Ser Asp Phe 
60 65 70 75 

tea tat tct tta eta aat gaa gaa ggg aat tct tea aaa aag gaa tat 471 

Leu Tyr Ser Leu Leu Asn Glu Glu Gly Asn Ser Ser Lys Lys Glu Tyr 
80 85 90 

aaa aat tta aaa gat gaa gaa aaa tat aat ate ata caa aat ata aaa 519 

Lys Asn Leu Lys Asp Glu Glu Lys Tyr Asn He lie Gin Asn lie Lys 
95 100 105 

aaa tat tgt gaa tgt act aaa aaa tat aaa agg etc cca aca cga gaa - 567 

Lys Tyr Cys Glu Cys Thr Lys Lys Tyr Lys Arg Leu Pro Thr Arg Glu 

110 115 120 

gta gtt att gga aat gtt aaa att gga gga aat aat aaa ata get att 615 

Val Val lie Gly Asn Val Lys lie Gly Gly Asn Asn Lys He Ala lie 
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125 130 135 



caa act atg get age tgt gat aca aga aat gta gaa gaa tgt gta tat 663 
Gin Thr Met Ala Ser Cys Asp Thr Arg Asn Val Glu Glu Cys Val Tyr 
140 145 150 155 

caa att aga aaa tgt aaa gat ttg ggt get gac att gta agg ttg act 711 

Gin lie Arg Lys Cys Lys Asp Leu Gly Ala Asp lie Val Arg Leu Thr 
160 165 170 

gtt caa gga gtt caa gaa gca caa get agt tat cat att aaa gaa aaa 759 

Val Gin Gly Val Gin Glu Ala Gin Ala Ser Tyr His lie Lys Glu Lys 

175 180 185 

tta tta tct gaa aat gta aat ate cca tta gta gca gat att cat ttt 807 

Leu Leu Ser Glu Asn Val Asn lie Pro Leu Val Ala Asp lie His Phe 
190 195 200 

aat cct aaa ata get tta atg gca get gat gtg ttt gaa aaa .att cga 855 

Asn Pro Lys lie Ala Leu Met Ala Ala Asp Val Phe Glu Lys lie Arg 
205 210 215 

gtg aat cca gga aat tat gtt gac gga aga aaa aaa tgg ata gat aaa 903 

Val Asn Pro Gly Asn Tyr Val Asp Gly Arg Lys Lys Trp lie Asp Lys 
220 225 230 235 

gtt tat aaa act aaa gaa gaa ttt gat gaa ggg aaa tta ttt ata aaa 951 

Val Tyr Lys Thr Lys Glu Glu Phe Asp Glu Gly Lys Leu Phe lie Lys 
240 245 250 

gaa aaa ttt gta cca tta att gaa aaa tgt aaa aga tta aat aga gca 999 

Glu Lys Phe Val Pro Leu lie Glu Lys Cys Lys Arg Leu Asn Arg Ala 

255 260 265 

ata aga att gga aca aat cat gga tec ctt tea tct cga gta tta tea 1047 

lie Arg lie Gly Thr Asn His Gly Ser Leu Ser Ser Arg Val Leu Ser 
270 275 280 

tat tat gga gat aca cca tta ggt atg gta gaa teg get ttt gag ttt 1095 

Tyr Tyr Gly Asp Thr Pro Leu Gly Met Val Glu Ser Ala Phe Glu Phe 
285 290 295 
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tcz gat tta tgt att gaa aac aat ttt tac aat ctt gtt ttt tct atg 1143 
Ser Asp Leu Cys lie Glu Asn Asn Phe Tyr Asn Leu Val Phe Ser Met 
300 305 310 315 

aaa get tct aat get tat gtt atg ata caa tct tat aga tta tta gta 1191 
Lys Ala Ser Asn Ala Tyr Val Met lie Gin Ser Tyr Arg Leu Leu Val 
320 325 330 

tct aaa caa tat gaa aga aat atg atg ttc cct ata cat tta gga gtt 1239 
Ser Lys Gin Tyr Glu Arg Asn Met Met Phe Pro lie His Leu Gly Val 
335 340 345 

aca gaa gca gga ttt ggt gat aat gga aga ata aaa tct tat tta ggt 1287 
Thr Glu Ala Gly Phe Gly Asp Asn Gly Arg He Lys Ser Tyr Leu Gly 
350 355 360 

ata gga tct tta tta tat gat ggt ata gga gat acc att cgt ata tec 1335 
He Gly Ser Leu Leu Tyr Asp Gly He Gly Asp Thr He Arg He Ser 
365 370 375 

tta aca gaa gat cct tgg gaa gag tta act cct tgt aaa aaa tta gtt 1383 
Leu Thr Glu Asp Pro Trp Glu Glu Leu Thr Pro Cys Lys Lys Leu Val 
380 385 390 395 

gaa aat tta aag aaa aga ata ttt tat aat gaa aat ttt aaa gaa gat 1431 
Glu Asn Leu Lys Lys Arg He Phe Tyr Asn Glu Asn Phe Lys Glu Asp 
400 405 410 

aat gaa tta aaa aat aat gaa atg gat acc aaa aat eta tta aat ttt 1479 
Asn Glu Leu Lys Asn Asn Glu Met Asp Thr Lys Asn Leu Leu Asn Phe 
415 420 425 

gaa gaa aat tat cga aat ttt aat aat ata aaa aaa aga aat gta gaa 1527 
Glu Glu Asn Tyr Arg Asn Phe Asn Asn He Lys Lys Arg Asn Val Glu 
430 435 440 

aaa aat aat aat gta tta cat gaa gag tgc act ata ggt aat gta gta * 1575 
Lys Asn Asn Asn Val Leu His Glu Glu Cys Thr He Gly Asn Val Val 
445 450 455 

acc ata aaa gag tta gaa gat tct ctg caa att ttt aaa gat tta aat 1623 
Thr He Lys Glu Leu Glu Asp Ser Leu Gin He Phe Lys Asp Leu Asn 
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460 465 470 475 



tta gaa gta gat tea aat gga aat ttg aaa aag gga gee aaa aca act 1671 
Leu Glu Val Asp Ser Asn Gly Asn Leu Lys Lys Gly Ala Lys Thr Thr 
480 485 490 



gat atg gtt att ata aat gat ttt cat aat ata aca aat tta gga aaa 1719 
Asp Met Val lie lie Asn Asp Phe His Asn lie Thr Asn Leu Gly Lys 
495 500 505 



aaa act gtg gat aaa tta atg caa gtg gga att aat ata gta gtt caa 1767 
Lys Thr Vai Asp Lys Leu Met Gin Val Gly He Asn He Val Val Gin 
510 515 . 520 



tat gaa cca cat aat ata .gaa ttt ata gaa aaa atg gaa cca aat aat 1815 
Tyr Glu Pro His Asn He Glu Phe He Glu Lys Met Glu Pro Asn Asn 
525 530 535 

gat aat aat aat aat aat aat aat aat aat ata tta ttt tat gtg gat 1863 
Asp Asn Asn Asn Asn Asn Asn Asn Asn Asn He Leu Phe Tyr Val Asp 
540 545 550 555 

ata aaa aat att atg aac agt tea gaa aaa aat att aaa tta agt aat 1911 
He Lys Asn He Met Asn Ser Ser Glu Lys Asn He Lys Leu Ser Asn 
560 565 570 

tct aaa gga tat gga tta att tta aac gga aaa gaa gat ata caa acc 1959 
Ser Lys Gly Tyr Gly Leu He Leu Asn Gly Lys Glu Asp He Gin Thr 
575 580 585 



ata aaa aaa ata aaa gaa tta aat cgt cgt cct tta ttc att eta tta 2007 
He Lys Lys He Lys Glu Leu Asn Arg Arg Pro Leu Phe He Leu Leu 
590 595 600 



aaa tea gat aac ata tat gaa cat gta tta ata acc aga aga att aat 2055 

Lys Ser Asp Asn He Tyr Glu His Val Leu He Thr Arg Arg He Asn 

605 610 615 

gaa ctt tta caa tec tta aat ata aat ata cct tat ata cat tat gtt 2103 

Glu Leu Leu Gin Ser Leu Asn He Asn He Pro Tyr lie His Tyr Vai 

620 625 630 635 
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gat att aat tea aac aat. tat gat gat 
Asp He Asn Ser Asn Asn Tyr Asp Asp 
640 

tat gca gga agt tgt ttg atg gat tta 
Tyr Ala Gly Ser Cys Leu Met Asp Leu 
655 660 
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ata tta gtt aat tea aca tta 2151 
He Leu Val Asn Ser Thr Leu 
645 650 

atg ggg gat ggt ctt att gtt 2199 
Met Gly Asp Gly Leu He Val 
665 



aac gta act aat gat gtt ctt aca aat aaa aaa aag ata gaa aca aaa 2247 
Asn Val Thr Asn Asp Val Leu Thr Asn. Lys Lys Lys He Glu Thr Lys 
670 675 680 

tat gat gaa aaa gaa gaa gta gag gaa gag gga aac aat aaa gat att 2295 
Tyr Asp Glu Lys Glu Glu Val Glu Glu Glu Gly Asn Asn Lys Asp He 
685 690 695 



cat aga ctt ttg age aga gtt gca tta aat tea ttt tta aca tta aat 2543 
His Arg Leu Leu Ser Arg Val Ala Leu Asn Ser Phe Leu Thr Leu Asn 
700 705 710 715 



att tta caa gat aca aga ata cgt tta ttt aaa aca gat tat ata gec 2391 
He Leu Gin Asp Thr Arg He Arg Leu Phe Lys Thr Asp Tyr He Ala 
720 725 730 



tgc cca tct tgt gga aga act tta ttt aat ata caa gaa act act aaa 2439 

Cys Pro Ser Cys Gly Arg Thr Leu Phe Asn lie Gin Glu Thr Thr Lys 
735 740 745 

aaa att atg aaa tta aca ggg cac tta aaa ggc gtt aaa att gca gtc 2487 

Lys He Met Lys Leu Thr Gly His Leu Lys Gly Val Lys He Ala Val 
750 755 760 



atg gga tgt att gtt aat ggt ata gga gaa atg gca gat gca cat ttt 2535 
Met Gly Cys He Val Asn Gly He Gly Glu Met Ala Asp Ala His Phe 
765 770 775 



ggt tat gtt ggt agt gca cct aaa aaa att gat tta tat tat ggt aaa- 2583 
Gly Tyr Val Gly Ser Ala Pro Lys Lys He Asp Leu Tyr Tyr Gly Lys 
780 785 790 795 



gag tta gta gaa aga aat ata cct gag gaa gaa get tgt gat aaa ttg 2631 
Glu Leu Val Glu Arg Asn He Pro Glu Glu Glu Ala Cys Asp Lys Leu 
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ata gaa tta att aaa aaa cat aac aaa tgg aaa gat cca taaattgaat 2680 
lie Glu Leu lie Lys Lys His Asn Lys Trp Lys Asp Pro 
815 820 

atggacaagt atttatttat ttatttatct tatatataat atattataaa tttttcgatg 2740 

tatttttcct tttaaaattt tatttttttt ttattttttt ttttgaagta atatttataa 2800 

tgca.tacata atattaaaat gtgtattata taataatatc attttattgt tattttaaaa 2860 

gactaatacc aagaacaatt ttttaataat cattcttata acttgttaaa tatatatata 2920 

tatatatata tatttattta tttatattta tatttattta tttttggtat atgaaaagta 2980 

aaaatataat aatttaaaag tatttacaaa ataaataata ttatatatct gtttttatat 3040 

atatgttaat ggaaaaggag aaaataaata aataaaacaa acaaaataac atatatatat 3100 

atatatatat actgaatgag aaagaaaaaa aaaagaaaag gatacga 3147 

<210> 6 
<211> 824 
<212> PRT 

<213> Plasmodium falciparum 
<400> 6 

Met Ser Tyr lie Lys Arg Leu lie Leu Phe Met Leu Leu Phe Tyr Ser 
15 10 15 

Kis Val Lys He Lys Lys Leu Phe He Lys He Ser Asn Val Asn He 
20 25 30 

Phe Phe Ala Glu Ala Lys Lys Asn Gly Lys Lys Glu Phe Phe Leu Phe 
35 40 45 

Leu Leu Asn lie Lys Lys Asn Ser Gin Gin Lys Lys Thr Tyr His He 
50 55 60 

Thr Lys Arg Asn Thr He Asn Lys Ser Asp Phe Leu Tyr Ser Leu Leu 
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65 70 75 80 

Asn Glu Glu Gly Asn Ser Ser Lys Lys Glu Tyr Lys Asn Leu Lys Asp 
85 90 95 

Glu Glu Lys Tyr Asn He He Gin Asn He Lys Lys Tyr Cys Glu Cys 
100 105 110 

Thr Lys Lys Tyr Lys Arg Leu Pro Thr Arg Glu Val Val He Gly Asn 
115 120 125 

Val Lys He Gly Gly Asn Asn Lys He Ala He Gin Thr Met Ala Ser 
130 135 140 

Cys Asp Thr Arg Asn Val Glu Glu Cys Val Tyr Gin He Arg Lys Cys 
145 150 155 160 

Lys Asp Leu Gly Ala Asp He Val Arg Leu Thr Val Gin Gly Val Gin 
165 170 175 

Glu Ala Gin Ala Ser Tyr His He Lys Glu Lys Leu Leu Ser Glu Asn 
180 185 190 

Val Asn He Pro Leu Val Ala Asp He His Phe Asn Pro Lys He Ala 
195 200 205 

Leu Met Ala Ala Asp Val Phe Glu Lys He Arg Val Asn Pro Gly Asn 
210 215 220 

Tyr Val Asp Gly Arg Lys Lys Trp He Asp Lys Val Tyr Lys Thr Lys 
225 230 235 240 

Glu Glu Phe Asp Glu Gly Lys Leu Phe He Lys Glu Lys Phe Val Pro 
245 250 255 

Leu He Glu Lys Cys Lys Arg Leu Asn Arg Ala He Arg He Gly Thr 
260 265 270 

Asn His Gly Ser Leu Ser Ser Arg Val Leu Ser Tyr Tyr Gly Asp Thr 
275 280 285 

Pro Leu Gly Met Val Glu Ser Ala Phe Glu Phe Ser Asp Leu Cys He 



BNSOOCID: <WO 0017233A*_L> 



WO 00/1 7233 PCT/EP99/07055 

28 

290 295 300 



Glu Asn Asn Phe Tyr Asn Leu Val Phe Ser Met Lys Ala Ser Asn Ala 
305 310 315 320 

Tyr Val Met lie Gin Ser Tyr Arg Leu Leu Val Ser Lys Gin Tyr Glu 
325 330 335 

Arg Asn Met Met Phe Pro He His Leu Gly Val Thr Glu Ala Gly Phe 
340 345 350 

Gly Asp Asn Gly Arg He Lys Ser Tyr Leu Gly He Gly Ser Leu Leu 
355 360 365 

Tyr Asp Gly He Gly Asp Thr lie Arg He Ser Leu Thr Glu Asp Pro 
370 375 380 

Trp Glu Glu Leu Thr Pro Cys Lys Lys Leu Val Glu Asn Leu Lys Lys 
385 390 395 400 

Arg He Phe Tyr Asn Glu Asn Phe Lys Glu Asp Asn Glu Leu Lys Asn 
405 410 415 

Asn Glu Met Asp Thr Lys Asn Leu Leu Asn Phe Glu Glu Asn Tyr Arg 
420 425 430 

Asn Phe Asn Asn lie Lys Lys Arg Asn Val Glu Lys Asn Asn Asn Val 
435 440 445 

Leu His Glu Glu Cys Thr lie Gly Asn Val Val Thr He Lys Glu Leu 
450 455 460 

Glu Asp Ser Leu Gin lie Phe Lys Asp Leu Asn Leu Glu Val Asp Ser 
465 470 475 480 

Asn Gly Asn Leu Lys Lys Gly Ala Lys Thr Thr Asp Met Val lie lie 
485 490 495 

Asn Asp Phe His Asn lie Thr Asn Leu Gly Lys Lys Thr Val Asp Lys 
500 505 510 

Leu Met Gin Val Gly lie Asn He Val Val Gin Tyr Glu Pro His Asn 
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515 520 525 

II Glu Phe He Glu Lys Met Glu Pro Asn Asn Asp Asn Asn Asn Asn 
530 535 540 

Asn Asn Asn Asn Asn He Leu Phe Tyr Val Asp He Lys Asn He Met 
545 550 555 560 

Asn Ser Ser Glu Lys Asn He Lys Leu Ser Asn Ser Lys Gly Tyr Gly 
565 570 575 

Leu He Leu Asn Gly Lys Glu Asp lie Gin Thr lie Lys Lys He Lys 
580 585 590 

Glu Leu Asn Arg Arg Pro Leu Phe He Leu Leu Lys Ser Asp Asn He 
595 600 605 

Tyr Glu His Val Leu He Thr Arg Arg He Asn Glu Leu Leu Gin Ser 
610 615 620 

Leu Asn He Asn He Pro Tyr He His Tyr Val Asp He Asn Ser Asn 
625 630 635 640 

Asn Tyr Asp Asp He Leu Val Asn Ser Thr Leu Tyr Ala Gly Ser Cys 
645 650 655 

Leu Met Asp Leu Met Gly Asp Gly Leu lie Val Asn Val Thr Asn Asp 
660 665 670 

Val Leu Thr Asn Lys Lys Lys lie Glu Thr Lys Tyr Asp Glu Lys Glu 
675 680 685 

Glu Val Glu Glu Glu Gly Asn Asn Lys Asp He His Arg Leu Leu Ser 
690 695 700 

Arg Val Ala Leu Asn Ser Phe Leu Thr Leu Asn He Leu Gin Asp Thr 
705 710 715 720 

Arg lie Arg Leu Phe Lys Thr Asp Tyr lie Ala Cys Pro Ser Cys Gly 
725 730 735 

Arg Thr Leu Phe Asn lie Gin Glu Thr Thr Lys Lys He Met Lys Leu 
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(57) Abstract 

The invention relates to the 1-desoxy- D-xylulose- 5-phosphate reductoisomerase gene, the 1-desoxy- D-xylulose- 5-phosphate- 
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(57) Zusammenfassung 

Die vorliegende Erhndung betrifft das 1-Desoxy- D-xylulose- 5-phosphatreduktoisomerase -Gen, das 1-Desoxy— D-xylulose- 
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Claims 

1. DNA sequences which code for a polypeptide with the 
amino acid sequence shown in SEQ ID no, 2 or for an 

5 analogue or derivative of the polypeptide according 

to SEQ ID no. 2, in which one or more amino acids 
have been deleted, added or replaced by other amino 
acids, wherein the enzymatic action of the 
polypeptide is retained, and which sequences 
10 originate from parasites, wherein sequence 

variations occurring within the framework of natural 
strain variability are included, 

2. DNA sequences which code for a polypeptide with the 
15 amino acid sequence shown in SEQ ID no. 4 or for an 

analogue or derivative of the polypeptide according 
to SEQ ID no. 4, in which one or more amino acids 
have been deleted, added or replaced by other amino 
acids, wherein the enzymatic action of the 
20 polypeptide is retained, and which sequences 

originate from parasites, wherein sequence 
variations occurring within the framework of natural 
strain variability are included. 

25 3. DNA sequences which code for a polypeptide with the 
amino acid sequence shown in SEQ ID no. 6 or for an 
analogue or derivative of the polypeptide according 
to SEQ ID no. 6, in which one or more amino acids 
have been deleted, added or replaced by other amino 

30 acids wherein the catalytic function of the 

polypeptide is retained. 
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4. DNA sequence according to one of claims 1 to 3, 
characterised in that it also comprises functional 
regulation signals, in particular promoters, 
operators, enhancers, ribosomal binding sites. 

5 

5. DNA sequence with the following sub-sequences 

i) promoter which is active in viruses, eukaryotes 
and prokaryotes and ensures the formation of an 
RNA in the intended target tissue or target 

10 cells, 

ii) DNA sequences according to one of claims 1 
to 3, 

iii) 3* untranslated sequence which, in viruses, 
eukaryotes and prokaryotes, results in the 

15 addition of poly (A) residues onto the 3' end of 

the RNA. 



6. Process for the production of transgenic viruses, 
eukaryotes and prokaryotes for modifying the 

20 isoprenoid content, characterised in that a DNA 

sequence according to claim 4 or 5 is transferred 
and incorporated into the genome of viruses, 
eukaryotic and prokaryotic cells with or without use 
of a vector. 

25 

7. Transgenic systems, in particular plants and plant 
cells which contain one or more DNA sequences 
according to claims 1 to 5 as "foreign" or 
"additional" DNA, which sequences are expressed. 

30 

8. Expression vector containing one or more DNA 
sequences according to claims 1 to 5. 
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9. Protein which is involved in the 1-deoxy-D-xylulose 
5-phosphate metabolic pathway and a) is coded by DNA 
sequences SEQ ID no. 1, 3 or 5 or b) is coded by DNA 
sequences which hybridise with DNA sequences SEQ ID 
no. 1, 3, 5 or fragments of these DNA sequences in 
the DNA region which codes for the mature protein. 

10. Protein according to claim 9, obtainable from the 
culture supernatants of parasites or from the 
disrupted parasites and purification by 
chromatographic and electrophoretic methods. 

11. Protein according to one of claims 9 and 10, 
characterised in that it a) is the product of viral, 
prokaryotic or eukaryotic expression of exogenous 
DNA, b) is coded by sequences SEQ ID no. 1, 3 or 5 
or is coded by DNA sequences which hybridise with 
DNA sequences SEQ ID no. 1, 3, 5 or fragments of 
these DNA sequences in the DNA region which codes 
for the mature protein, or c) is coded by DNA 
sequences which would hybridise without degeneration 
of the genetic code with the sequences defined in b) 
and which code for a polypeptide with a 
corresponding amino acid sequence. 

12. Protein according to one of the preceding claims, 
characterised in that it comprises the amino acid 
sequences SEQ ID no. 2, 4 or 6. 

13. Process for determining the enzymatic activity of 
the gcpE protein, characterised in that 
phosphorylation of a sugar or of a phosphorus sugar 
or of a precursor of isoprenoid biosynthesis, in 



WO 00/17233 



CA 02343919 2001-03-22 

Amendments 



PCT/EP99/07055 



particular the phosphorylation of 2-C-methyl-D- 
erythritol, 2-C-methyl-D-erytritol phosphate, in 
particular 2-C-methyl-D-erythritol 4-phosphate, 2-C- 
methyl-D-erythrose, 2-C-methyl-D-erythrose 
phosphate, in particular 2-C-methyl-D-erythrose 
4-phosphate, and of phosphate and alcohol 
precursors, is detected. 

14. Process according to claim 13, characterised in that 
phosphorylation of the following phosphates or 
alcohols is detected: 
CH 2 (OH) -C (CH 3 ) =C (OH) -CH 2 -0-P0 (OH) 2, 
CH 2 ( OH ) -C ( CH 3 ) =C ( OH ) -CH 2 -OH , 
CH 2 (OH) -CH (CH 3 ) -CO-CH 2 -O-P0 (OH) 2, 
CH 2 (OH) -CH (CH 3 ) -CO-CH 2 OH 
CH 2 =C (CH 3 ) -CO-CH2-O-PO (OH) 2 , 
CH 2 =C (CH 3 ) -CO-CH2-OH, 
CH 2 =C (CH 3 ) -CH (OH) -CH2-O-PO (OH) 2 ,' 
CH 2 =C (CH 3 ) -CH (OH) -CH 2 -OH, 
CH 2 (OH) -C (=CH 2 ) -C (OH) -CH 2 -0-PO (OH) 2 , 
CH 2 (OH) -C (=CH 2 ) -C (OH) -CH 2 -0H 
CHO-CH (CH 3 ) -CH (OH) -CH 2 -0-PO- (OH) 2 , 
CHO-CH (CH 3 ) -CH (OH) -CH 2 -OH, 
CH 2 (OH)-C(OH) (CH 3 )-CH=CH-0-PO(OH) 2 , 
CH 2 (OH) -C (OH) (CH 3 ) -CH=CH-OH 
CH (OH) =C (CH 3 ) -CH (OH) -CH 2 -0-PO(OH) 2 , 
CH (OH) =C (CH 3 ) -CH (OH) -CH 2 -OH, 
(CH 3 ) 2 HC-CO-CH 2 -0-PO (OH) 2, 
(CH 3 ) 2 HC-CO-CH 2 -0-H, 
(CH 3 ) 2 HC-CH (OH) -CH 2 -0-PO (OH) 2 , 
(CH 3 ) 2 HC-CH (OH) -CH 2 -0-H. 
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15. Process for the combined determination of the 

enzymatic activity of DOXP synthase and of DOXP 
reductase, characterised in that the conversion of 
glyceraldehyde 3-phosphate into 2-C-methylerythritol 
5 4-phosphate is detected. 



16. Process for screening a compound for the treatment 
of infectious processes in humans and animals, 
wherein the process comprises: 
10 a) provision of a host cell which contains a 

recombinant expression vector, wherein the 
vector comprises at least a portion of the 
oligonucleotide sequence according to SEQ ID 
no. 1, SEQ ID no. 3 or SEQ ID no. 5 or variants 
15 or analogues thereof, and moreover of a 

compound suspected to have antimycotic, 
antibiotic, antiparasitic or antiviral action 
in humans and animals, 

b) bringing the host cell into contact with the 
20 compound and 

c) determining the antimicrobial, antimycotic, 
antibiotic, antiparasitic or antiviral action 
of the compound. 

25 17. Process for screening for compounds for treating 
plants, wherein the process comprises: 
a) provision of a host cell which contains a 
recombinant expression vector, wherein the 
vector comprises at least a portion of the 
30 oligonucleotide sequence according to SEQ ID 

no. 1, SEQ ID no. 3 or SEQ ID no. 5 or variants 
or analogues thereof, and moreover of a 
compound suspected to have antimicrobial, 
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antiviral, antiparasitic, bactericidal, 
fungicidal or herbicidal action in plants, 
b) bringing the host cell into contact with the 
compound and 

5 c) determining the antimicrobial, antiviral, 

antiparasitic, bactericidal, fungicidal or 
herbicidal action of the compound. 

18. Use of DNA according to one of . claims 1 to 5 or of 
10 proteins according to one of claims 9 to 12 or of 

transgenic systems according to claim 7 for the 
prevention or treatment of diseases in humans and 
animals. 



CA 02343919 2001-03-22 

WO 00/17233 PCT/EP99/07055 

-1- 

Genes of the 1 -deoxy-D-xylulose biosynthesis pathway 

The present invention relates to DNA sequences which, 
when incorporated into the genome of viruses, eukaryotes 
5 and prokaryotes, modify isoprenoid biosynthesis and to a 
genetic engineering process for the production of . these 
transgenic viruses, eukaryotes and prokaryotes- The 
invention also relates to a process for the 
identification of substances having herbicidal, 
10 antimicrobial, antiparasitic, antiviral, fungicidal, 
bactericidal action in plants and antimicrobial, 
antiparasitic, antimycotic, antibacterial and antiviral 
action in humans and animals. 

15 The biosynthesis pathway for the formation of isoprenoids 
via the classical acetate/mevalonate pathway and an 
alternative mevalonate-independent biosynthesis pathway, 
the deoxy-D-xylulose phosphate pathway is already known 
(Rohmer, M. , Knani, M., Simonin, P., Sutter, B. and Sahm, 

20 H. (1993): Biochem. J. 295: 517-524). 

It is, however, not known how and by which pathways it is 
possible to bring about a change in the isoprenoid 
concentration in viruses, eukaryotes and prokaryotes by 
25 means of the deoxy-D-xylulose phosphate pathway. Figure 1 
shows this biosynthesis pathway. 

DNA sequences are consequently provided which code for 
1-deoxy-D-xylulase 5-phosphate synthase (DOXP synthase), 
30 1-deoxy-D-xylulose 5-phosphate reductoisomerase (DOXP 
reductoisomerase) or the gcpE protein. All three genes 
and enzymes are involved in isoprenoid biosynthesis. 
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(Translator's comment: The portion at the beginning of the next paragraph enclosed in 
square brackets corresponds to the beginning of the sentence which finishes on page 
2, line 1 of the original). 

[The gcpE protein has a kinase function and catalyses the 
5 phosphorylation of a sugar or a phosphorus sugar or a 

precursor of isoprenoid biosynthesis, in particular the 
phosphorylation of 2-C-methyl-D-erythritol, 2-C-methyl-D- 
erytritol phosphate, in particular 2-C-methyl-D- 
erythritol 4 -phosphate, 2-C-methyl-D-erythrose, 2-C- 

10 methyl-D-erythrose] phosphate, in particular 2-C-methyl- 
D-erythrose 4 -phosphate. In the precursor of isoprenoid 
synthesis, the gcpE protein in particular catalyses the 
phosphorylation of the following substances: 
CH 2 (OH) -C (CH 3 ) =C (OH) -CH 2 -0-PO (OH) 2 , 

15 CH 2 (OH) -C (CH 3 ) =C (OH) -CH 2 -OH, 

CH 2 (OH) -CH <CH 3 ) -CO-CH2-O-PO (OH) 2/ 
CH 2 (OH) -CH (CH 3 ) -CC-CH2OH 
CH 2 =C (CH 3 ) -CO-CH2-O-PO (OH) 2 , 
CH 2 =C (CH 3 ) -CO-CH 2 -OH, 

20 CH 2 =C (CH 3 ) -CH (OH) -CH 2 -0-PO (OH) 2 , 
CH 2 =C (CH 3 ) -CH (OH) -CH 2 -OH, 
CH 2 (OH) -C (=CH 2 ) -C (OH) -CH 2 -0-PO (OH) 2 , 
CH 2 (OH) -C (=CH 2 ) -C (OH) -CH 2 -OH 
CHO-CH (CH 3 ) -CH (OH) -CH 2 -O-P0- (OH) 2f 

25 CHO-CH (CH 3 ) -CH (OH) -CH 2 -OH, 

CH 2 (OH) -C (OH) (CH 3 ) -CH-CH-O-PO (OH) 2 , 
CH 2 (OH)-C(OH) (CH 3 )-CH=CH-OH 
CH (OH) =C (CH 3 ) -CH (OH) -CH 2 -0-PO (OH) 2 , 
CH (OH) »C (CH 3 ) -CH (OH) -CH 2 -OH, 

30 (CH 3 ) 2 HC-CO-CH 2 -0-PO (OH) 2 , 
(CH 3 ) 2 HC-CO-CH 2 -0-H, 
(CH 3 ) 2 HC-CH (OH) -CH 2 -0-PO (OH) 2 , 
(CH 3 ) 2 HC-CH (OH) -CH2-O-H. 
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DOXP synthase catalyses the condensation of pyruvate and 
glyceraldehyde 3-phosphate to yield 1-deoxy-D-xylulose 
5-phosphate and DOXP reductoisomerase catalyses the 
5 conversion of 1-deoxy-D-xylulose 5-phosphate into 
2-C-methyl-D-erythritol 4-phosphate {c.f. Fig, 1). 

The invention relates to the following DNA sequences: 
DNA sequences which code for a polypeptide with the amino 

10 acid sequence shown in SEQ ID no. 2 or for an analogue or 
derivative of the polypeptide according to SEQ ID no. 2, 
in which one or more amino acids have been deleted, added 
or replaced by other amino acids, wherein the enzymatic 
action of the polypeptide is retained, and which 

15 sequences originate from parasites, wherein sequence 
variations occurring within the framework of natural 
strain variability are included, 

DNA sequences which code for a polypeptide with the amino 
20 acid sequence shown in SEQ ID no. 4 or for an analogue or 
derivative of the polypeptide according to SEQ ID no. 4, 
in which one or more amino acids have been deleted, added 
or replaced by other amino acids, wherein the enzymatic 
action of the polypeptide is retained, and which 
25 sequences originate from parasites, wherein sequence 
variations occurring within the framework of natural 
strain variability are included, 

and DNA sequences which code for a polypeptide with the 
30 amino acid sequence shown in SEQ ID no. 6 or for an 

analogue or derivative of the polypeptide according to 
SEQ ID no. 6, in which one or more amino acids have been 
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deleted, added or replaced by other amino acids, wherein 

the catalytic function of the polypeptide is retained. 



The genes and the gene products thereof (polypeptides) 
are shown with their primary structure and are assigned 

as follows: 

SEQ ID no. 1: 1-deoxy-D-xylulose 5-phosphate reducto- 

isomerase gene 
SEQ ID no. 2: 1-deoxy-D-xylulose 5-phosphate reducto- 

isomerase 

SEQ ID no. 3: 1-deoxy-D-xylulose 5-phosphate synthase 
gene 

SEQ ID no. 4: 1-deoxy-D-xylulose 5-phosphate synthase 
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SEQ ID no. 5: gcpE gene 

SEQ ID no. 6: gcpE proteins. 



The DNA sequences all originate from Plasmodium 
5 falciparum. 

Apart from the DNA sequences stated in the sequence 
listing, suitable sequences are also those which, as a 
result of the degeneration of the genetic code, have 
10 another DNA sequence, but code for the same peptide or 
for an analogue or derivative of the polypeptide, in 
which one or more amino acids have been deleted, added or 
replaced by other amino acids. 

15 The sequences according to the invention are suitable for 
the expression of genes in viruses, eukaryotes and 
prokaryotes which are responsible for isoprenoid 
biosynthesis in the 1-deoxy-D-xylulose pathway. 

20 According to the invention, eukaryotes or eukaryotic 

cells include animal cells, plant cells, algae, yeasts, 
. -.fungi r whil-e pro-karyotes or prokaryatic cells include 
bacteria, archaebacteria and eubacteria. 

. 25 When a DNA sequence is incorporated into a genome on 
which the above-stated DNA sequence is located, 
expression of the above-described genes in viruses, 
eukaryotes and prokaryotes is enabled. The viruses, 
eukaryotes and prokaryotes transformed according to the 
30 invention are cultivated in a manner known per se and the 
isoprenoid formed during such cultivation is isolated and 
optionally purified. Not all isoprenoids need to be 
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isolated as in some case the isoprenoids are releas d 
directly into the ambient air. 

The invention furthermore relates to a process for the 
production of transgenic viruses, eukaryotes and 
prokaryotes in order to modify the isoprenoid content, 
which process comprises the following steps. 

a) Production of a DNA sequence with the following sub- 
sequences 

i) promoter which is active in viruses, eukaryotes 
and prokaryotes and ensures the formation of an 
RNA in the intended target tissue or target 
cells, 

ii) DNA sequence which codes for a polypeptide with 
the amino acid sequence shown in. SEQ ID no. 2, 

4 or 6 or for an analogue or derivative of the 
polypeptide according to SEQ ID no. 2, 4 or 6, 

iii) 5' and 3 1 untranslated sequence which enables 
or enhances expression of the stated genes in 
viruses, eukaryotes and prokaryotes, 

Jo), transfer and incorporation of the DNA sequence into 
the genome of viruses, prokaryotic or eukaryotic 
cells with or without the use of a vector (for 
example plasmid, viral DNA) . 

The intact, whole plants may be regenerated from plant 
cells transformed in this manner. 

The protein-coding sequences with the nucleotide 
sequences SEQ ID no. 1, SEQ ID no. 3 and SEQ ID no. 5 may 
be provided with a promoter which ensures transcription 
in certain organs or cells, which promoter is coupled in 
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sense orientation (3' end of the promoter to the 5' end 
of the coding sequence) to the sequence which codes the 
protein to be formed. A termination signal which 
determines termination of mRNA synthesis is attached to 
5 the 3 1 end of the coding sequence. In order to direct the 
protein which is to be expressed to certain subcellular 
compartments, such as chloroplasts, amyloplasts, 
mitochondria, vacuoles, cytosol or intercellular spaces, 
a further sequence which codes for a so-called signal 

10 sequence or a transit peptide may be inserted between the 
promoter and the coding sequence. In some cases, it is 
necessary to insert sequences which code for a signal at 
the COOH terminus of the protein. The sequence must be in 
the same reading frame as the coding sequence of the 

15 protein. A large number of cloning vectors is available 
in order to prepare for the introduction of the DNA 
sequences according to the invention into higher plants, 
which vectors contain a replication signal for E. coli 
and a marker which permits selection of the transformed 

20 cells. Depending upon the method by which desired genes 
are introduced into the plant, further DNA sequences may 
be. required. If, -for example, the Ti or Ri plasmid is 
used to transform the plant cells, at least one right 
border, but frequently the right border and left border 

25 of the Ti and Ri plasmid T-DNA must be inserted as a 

flanking region into the genes to be introduced. The use 
of T-DNA for transforming plant cells has been 
intensively investigated and comprehensively described in 
EP 120516; Hoekama in "The Binary Plant Vector System", 

30 Of f set-drukkeri j Kanters B.V. Alblasserdam (1985), 

chapter V; Fraley et al., Crit .Rev. Plant Sex. 4, 1-46 and 
An et al. (1985) EMBO J. 4, 277-287. Once the introduced 
DNA has been incorporated into the genome, it is 
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generally stable and is also retained in the descendants 
of the originally transformed cells. It normally contains 
a selection marker, which imparts to the transformed 
plant cells resistance to a biocide or an antibiotic, 
5 such as kanamycin, G 418, bleomycin, hygromycin or 

phosphinotricin and others. The particular marker used is 
thus intended to allow selection of transformed cells 
from cells lacking the inserted DNA. 

10 Many techniques are available for introducing DNA into a . 
plant. These techniques include transformation with the 
assistance of agrobacteria, for example Agrobacterium 
tumefaciens, protoplast fusion, microinjection of DNA, 
electroporation, as well as ballistic methods and virus 

15 infection. Whole plants may then be regenerated from the 
transformed plant material in a suitable medium which may 
contain antibiotics or biocides for selection purposes. 
No particular requirements are placed upon the plasmids 
for injection and electroporation. However, if whole 

20 plants are to be regenerated from such transformed cells, 
a selectable marker gene must be present. The transformed 
. . . .cells ..grow., in. the plants in the conventional manner 

(McCormick et al. (1986), Plant Cell Reports 5, 81-84).. 
The plants may be cultivated normally and be crossed with 

25 plants which have the same transformed genome or other 

genomes. The resultant individuals have the corresponding 
phenotypic properties. 

The present invention also provides expression vectors 
30 which contain one or more of the DNA sequences according 
to the invention. Such expression vectors are obtained by 
providing the DNA sequences according to the invention 
with suitable functional regulation signals. Such 



CA 02343919 2001-03-22 



WO 00/17233 PCT/EP99/07055 

-8- 

regulation signals are DNA sequences which are 
responsible for expression, for example promoters, 
operators, enhancers, ribosomal binding sites, and are 
recognised by the host organism. 

5 

Further regulation signals, which for example control 
replication or recombination of the recombinant DNA in 
the host organism, may optionally also be a constituent 
part of the expression vector. 

10 

The host organisms transformed with the DNA sequences or 
expression vectors according to the invention are also 
provided by the present invention. 

15 Suitable host cells and organisms for expressing the 
enzymes according to the invention are those which 
comprise no intrinsic enzymes with the function of DOXP 
synthase, DOXP reductoisomerase or the gcpE protein. This 
is the case for archaebacteria, animals, fungi, slime 

20 moulds and some eubacteria. The absence of such intrinsic 
enzyme activity substantially facilitates detection and 
■ purification of : the recombinant enzymes. As a 

consequence, it is also for the first time possible 
straightforwardly to measure, in crude extracts from the 

25 host cells, the activity and in particular the inhibition 
of the activity of the recombinant enzymes according to 
the invention by various chemicals and pharmaceuticals. 

The enzymes according to the invention are advantageously 
30 then expressed in eukaryotic cells if post-translational 
modification and native folding of the polypeptide chain 
is to be achieved. Moreover, depending upon the 
expression system, it is ensured when expressing genomic 
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DNA sequences that introns are eliminated by splicing the 
DNA and. the enzymes are produced in the polypeptide 
sequences characteristic to the parasites. Using 
recombinant DNA techniques, sequences coding for introns 
5 may be eliminated from or inserted for experimental 
purposes into the DNA sequences to be expressed. 

The protein may be isolated from the host cell or the 
culture supernatant of the host cell using methods known 
10 to the person skilled in the art. In vitro reactivation 
of the enzymes may also be required. 

In order to facilitate purification, the enzymes 
according to the invention or sub-sequences of the 
15 enzymes may be expressed as fusion proteins with various 
peptide chains. Oligo-histidine sequences and sequences 
derived from glutathione S-transf erase, thioredoxin or 
calmodulin-binding peptides are particularly suitable for 
this purpose. 

20 

The enzymes according to the invention or sub-sequences 
...of. ...the enzymes may furthermore be expressed , as fusion 
proteins with such peptide chains known to the person 
skilled in the art that the recombinant enzymes are 
25 transported into the extracellular medium or into certain 
compartments of the host cells. Both purification and 
investigation of the biological activity of the enzymes 
may consequently be facilitated. 

30 When expressing the enzymes according to the invention, 
it may prove convenient to modify individual codons. 
Purposeful replacement of bases in the coding region may 
here also be advisable if the codons used in the 
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parasites differ from the codon use in the heterologous 
expression system, in order to ensure optimal synthesis 
of the protein. 

5 The enzymes according to the invention may furthermore be 
obtained under standardised conditions by in vitro 
translation by methods known to the person skilled in the 
art. Systems suitable for this purpose are rabbit 
reticulocyte and wheat germ extracts and bacterial 
10 lysates. In vitro transcribed mRNA may also be translated 
into Xenopus oocytes. 

Oligo- and polypeptides, the sequences of which are 
derived from the peptide sequence of the enzymes 

15 according to the invention, may be obtained by chemical 
synthesis. Given appropriate selection of the sequences, 
such peptides have properties which are characteristic of 
the enzymes according to the invention. Such peptides may 
be produced in large quantities and are particularly 

20 suitable for investigating the kinetics of enzyme 

activity, regulation of enzyme activity, the three- 
dimensional, structure of the enzymes,, inhibition .of 
enzyme activity by various chemicals and pharmaceuticals 
and the binding geometry and binding affinity of various 

25 ligands. 

DNA with the nucleotides from sequences SEQ ID no. 1, 3 
and 5 are preferably used for the recombinant production 
of the enzymes according to the invention. 

30 

The invention accordingly moreover relates to a process 
for screening for compounds which inhibit the deoxy-D- 
xylulose phosphate metabolic pathway. According to this 
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process, a host organism, which contains a recombinant 
expression vector, wherein the vector comprises at least 
a portion of the oligonucleotide sequence according to 
SEQ IE) no. 1, SEQ ID no. 3 or SEQ ID no. 5 or variants or 
5 homologues thereof, is provided, as is a compound which 
is suspected to have antimicrobial, antiparasitic, 
antibacterial, antiviral and antimycotic action in humans 
and animals or an antimicrobial, antiviral, bactericidal, 
herbicidal or fungicidal activity in plants. The host 
10 organism is then brought into contact with the compound 
and the activity of the compound determined. 

The present invention also provides methods for 
determining the enzymatic activity of the gcpE protein. 

15 Said activity may be determined using known methods. 
Determination is performed by detecting the 
phosphorylation of a sugar or of a phosphorus sugar or of 
a precursor of isoprenoid biosynthesis, in particular the 
phosphorylation of 2-C-methyl-D-erythritol, 2-C-methyl-D- 

20 erytritol phosphate, in particular 2-C-methyl-D- 
erythritol 4 -phosphate, 2-C-methyl-D-erythrose, 2-C- 
.methyl-D^erythrose^ phosphate., in particular 2-C-methyl-D- 
erythrose 4-phosphate. The present invention also 
provides the use of this measurement method for 

25 identifying substances which inhibit the activity of the 
particular enzymes* 

The enzymatic activity of DOXP synthase and D0XP 
reductoisomerase may be detected in a single step by 
30 determining the conversion of glyceraldehyde 3-phosphate 
into 2-C-methylerythritol 4-phosphate. 
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Determination of the activities of DOXP synthase and DOXP 
reductoisomerase proceeds analogously. Fluorimetric 
methods described by Querol et al. are also suitable for 
determining DOXP synthase activity (Querol et al., 
abstracts, 4 th European Symposium on Plant Isoprenoids, 
Barcelona, 21-23 April 1999} . 
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SEQUENCE LISTING 



<110> Jomaa, Hassan 

' <120>* 'Genes' of ^the 1-deoxy-D^xylulose biosynthesi-s pathway 

<130> 15696 

<140> PCT/EP99 
<141> 1999-09-22 

<150> DEr9923567.8 
<151> 1999-05-22 

<150> DE19843279.8 
<151> 1998-09-22 

<160> 6 

<170> Patentln Ver. 2.1 

<210> 1 
<211> 1467 
<212> DNA 

<213> Plasmodium falciparum 

<220> 
<221> CDS 
<222> (1) . . (1467) 

<220> 

<221> gene 
<222> (1) . . (1467) 

<220> 

<221>.mRNA 
<222> (1) . . {1467) ' 

<400> 1 

atg aag aaa tat att tat ata tat 
Met Lys Lys Tyr lie Tyr He Tyr 
1 5 

aat gat tta gta ata aat aat aca 
Asn Asp Leu Val He Asn Asn Thr 
20 

aga aaa aat aac gca tat ata aat 
Arg Lys Asn Asn Ala Tyr He Asn 
35 40 



ttt ttc ttc ate aca ata act att 48 

Phe Phe Phe He Thr He Thr He 

10 15 

tea aaa tgt gtt tec att gaa aga 96 

Ser Lys Cys Val Ser He Glu Arg 

25 30 

tat ggt ata gga tat aat gga cca 144 

Tyr Gly He Gly Tyr Asn Gly Pro 
45 



gat aat aaa ata aca aag agt aga aga tgt aaa aga ata aag tta tgc 192 
Asp Asn Lys He Thr Lys Ser Arg Arg Cys Lys Arg He Lys Leu Cys 
50 55 60 
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aaa aag gat tta ata gat att ggt gca ata aag aaa cca att aat gta 240 

Lys Lys Asp Leu lie Asp lie Gly Ala lie Lys Lys Pro lie Asn Val 

65 70 75 80 

gca att ttt gga agt act ggt agt ata ggt acg aat get tta aat ata 288 

Ala lie i Phe- 'Gly Ser -Thr Gly Sex lie Gly Thr Asn Ala Leu Asn lie 

85 90 95 

ata agg gag tgt aat aaa att gaa aat gtt ttt aat gtt aaa gca ttg 336 

lie Arg Glu Cys Asn Lys lie Glu Asn Val Phe Asn Val Lys Ala Leu 

100 105 110 

tat gtg aat aag agt gtg aat gaa tta tat gaa caa get aga gaa ttt 384 

Tyr Val Asn Lys Ser Val Asn Glu Leu Tyr Glu Gin Ala Arg Glu Phe 

115 120 125 

tta cca gaa tat ttg tgt ata cat gat aaa agt gta tat gaa gaa tta 432 

Leu Pro Glu Tyr Leu Cys He His Asp Lys Ser Val Tyr Glu Glu Leu 

130 135 140 

aaa gaa ctg gta aaa aat ata aaa gat tat aaa cct ata ata ttg tgt 480 

Lys Glu Leu Val Lys Asn He Lys Asp Tyr Lys Pro lie He Leu Cys 

145 150 155 160 

ggt gat gaa ggg atg aaa gaa ata tgt agt agt aat agt ata gat aaa 528 

Gly Asp Glu Gly Met Lys Glu He Cys Ser Ser Asn Ser He Asp Lys 

165 170 175 

ata gtt att ggt att gat tct ttt caa gga tta tat tct act atg tat 576 

He Val He Gly lie Asp Ser Phe Gin Gly Leu Tyr Ser Thr Met Tyr 

180 185 190 

gca att atg aat aat aaa ata gtt gcg tta get aat aaa gaa tec att 624 

Ala He Met Asn Asn Lys He Val Ala Leu Ala Asn Lys Glu Ser He 

195 200 205 

gtc tct get ggt ttc ttt tta aag aaa tta tta aat att cat aaa aat 672 

Val Ser Ala Gly Phe Phe Leu Lys Lys Leu Leu Asn He His Lys Asn 

210 215 220 

gca aag ata ata cct gtt gat tea gaa cat agt get ata ttt caa tgt 720 

Ala Lys He He Pro Val Asp Ser Glu His Ser Ala He Phe Gin Cys 

225 230 235 240 

tta gat aat aat aag gta tta aaa aca aaa tgt tta caa gac aat ttt 768 

Leu Asp Asn Asn Lys Val Leu Lys Thr Lys Cys Leu Gin Asp Asn Phe 

245 250 255 

tct aaa att aac aat ata aat aaa ata ttt tta tgt tea tct gga ggt 816 

Ser Lys He Asn Asn He Asn Lys He Phe Leu Cys Ser Ser Gly Gly 

260 265 270 

cca ttt caa aat tta act atg gac gaa tta aaa aat gta aca tea gaa 864 

Pro Phe Gin Asn Leu Thr Met Asp Glu Leu Lys Asn Val Thr Ser Glu 

275 280 285 
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aat get tta aag cat cct aaa tgg aaa atg ggt aag aaa ata act ata 912 
Asn Ala Leu Lys His Pro Lys Trp Lys Met Gly Lys Lys lie Thr lie 
290 295 300 

gat tct gca act atg atg aat aaa ggt tta gag gtt ata gaa acc cat 960 
Asp *Ser Ala Thr Met Met: Asn Lys l Gly Leu Glu Val lie Glu Thr His 
305 310 315 320 

ttt tta ttt gat gta gat tat aat gat ata gaa gtt ata gta cat aaa 1008 
Phe Leu Phe Asp Val Asp Tyr Asn Asp lie Glu Val lie Val His Lys 
325 330 335 

gaa tgc att ata cat tct tgt gtt gaa ttt ata gac aaa tea gta ata 1056 
Glu Cys 21e lie His Ser Cys Val Glu Phe He Asp Lys Ser Val He 
340 345 350 

agt caa atg tat tat cca gat atg caa ata ccc ata tta tat tct tta 1104 
Ser Gin Met Tyr Tyr Pro Asp Met Gin lie Pro He Leu Tyr Ser Leu 
355 360 365 

aca tgg cct gat aga ata aaa aca aat tta aaa cct tta gat ttg get 1152 
Thr Trp Pro Asp Arg He Lys Thr Asn Leu Lys Pro Leu Asp Leu Ala 
370 375 380 

cag gtt tea act ctt aca ttt cat aaa cct tct tta gaa cat ttc ccg 1200 
Gin Val Ser Thr Leu Thr Phe His Lys Pro Ser Leu Glu His Phe Pro 
385 390 395 400 

tgt att aaa tta get tat caa gca ggt ata aaa gga aac ttt tat cca 1248 
Cys He Lys Leu Ala Tyr Gin Ala Gly He Lys Gly Asn Phe Tyr Pro 
405 410 415 

act gta eta aat gcg tea aat gaa ata get aac aac tta ttt ttg aat 1296 
Thr Val Leu Asn Ala Ser Asn Glu He Ala Asn Asn Leu Phe Leu Asn 
420 425 430 

aat aaa att aaa tat ttt gat att tec tct ata ata teg caa gtt ctt 1344 
Asn Lys He Lys Tyr Phe Asp He Ser Ser He He Ser Gin Val Leu 
435 440 445 

gaa tct ttc aat tct caa aag gtt teg gaa aat agt gaa gat tta atg 1392 
Glu Ser Phe Asn Ser Gin Lys Val Ser Glu Asn Ser Glu Asp Leu Met 
450 455 460 

aag caa att eta caa ata cat tct tgg gec aaa gat aaa get acc gat 144 0 
Lys Gin He Leu Gin lie His Ser Trp Ala Lys Asp Lys Ala Thr Asp 
465 470 475 480 

ata tac aac aaa cat aat tct tea tag 1467 
He Tyr Asn Lys His Asn Ser Ser 
485 



<210> 2 
<211> 488 
<212> PRT 

<213> Plasmodium falciparum 
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<400> 2 

Met Lys Lys Tyr lie Tyr He Tyr Phe Phe Phe He Thr He Thr He 
15 10 -15 

Asn Asp Leu Val He Asn Asn Thr Ser Lys Cys Val Ser He Glu Arg 
'20 25 30 

Arg Lys Asn Asn Ala Tyr He Asn Tyr Gly He Gly Tyr Asn Gly Pro 
35 40 45 

Asp Asn Lys He Thr Lys Ser Arg Arg Cys Lys Arg He Lys Leu Cys 
50 55 60 

Lys Lys Asp Leu He Asp He Gly Ala He Lys Lys Pro He Asn Val 
65 70 75 80 

Ala He Phe Gly Ser Thr Gly Ser He Gly Thr Asn Ala Leu Asn He 
85 90 95 

He Arg Glu Cys Asn Lys He Glu Asn Val Phe Asn Val Lys Ala Leu 
100 105 110 

Tyr Val Asn Lys Ser Val Asn Glu Leu Tyr Glu Gin Ala Arg Glu Phe 
115 120 125 

Leu Pro Glu Tyr Leu Cys He His Asp Lys Ser Val Tyr Glu Glu Leu 
130 135 140 

Lys Glu Leu Val Lys Asn He Lys Asp Tyr Lys Pro He He Leu Cys 
145 150 155 160 

Gly Asj> Glu Gly . Met Lys Glu lie Cys Ser .Ser Asn- Ser He Asp Lys 
165 170 175 

He Val He Gly. He Asp Ser Phe Gin Gly Leu Tyr Ser Thr Met Tyr 
180 185 190 

Ala He Met Asn Asn Lys He Val Ala Leu Ala Asn Lys Glu Ser He 
195 200 205 

Val Ser Ala Gly Phe Phe Leu Lys Lys Leu Leu Asn He His Lys Asn 
210 215 220 

Ala Lys He He Pro Val Asp Ser Glu His Ser Ala He Phe Gin Cys 
225 230 235 240 

Leu Asp Asn Asn Lys Val Leu Lys Thr Lys Cys Leu Gin Asp Asn Phe 
245 250 255 

Ser Lys He Asn Asn He Asn Lys He Phe Leu Cys Ser Ser Gly Gly 
260 265 270 



Pro Phe Gin Asn Leu Thr Met Asp Glu Leu Lys Asn Val Thr Ser Glu 
275 280 285 



Asn Ala Leu Lys His Pro Lys Trp Lys Met Gly Lys Lys He Thr He 
290 295 300 



CA 02343919 2001-03-22 



WO 00/17233 PCT/EP99/07055 

5- 



Asp S r Ala Thr Met Met Asn Lys Gly Leu Glu Val He Glu Thr His 
305 310 315 320 

Phe Leu Phe Asp Val Asp Tyr Asn Asp He Glu Val He Val His Lys 
325 330 335 

Glu Cys He lie His Ser Cys Val Glu Phe He Asp Lys Ser Val He 
340 345 350 

Ser Gin Met Tyr Tyr Pro Asp Met Gin He Pro He Leu Tyr Ser Leu 
355 360 365 

Thr Trp Pro Asp Arg He Lys Thr Asn Leu Lys Pro Leu Asp Leu Ala 
370 375 380 

Gin Val Ser Thr Leu Thr Phe His Lys Pro Ser Leu Glu His Phe Pro 
385 390 395 400 

Cys He Lys Leu Ala Tyr Gin Ala Gly He Lys Gly Asn Phe Tyr Pro 
405 410 415 

Thr Val Leu Asn Ala Ser Asn Glu He Ala Asn Asn Leu Phe Leu Asn 
420 425 430 

Asn Lys He Lys Tyr Phe Asp He Ser Ser lie lie Ser Gin Val Leu 
435 440 445 

Glu Ser Phe Asn Ser Gin Lys Val Ser Glu Asn Ser Glu Asp Leu Met 
450 455 460 

Lys Gin He Leu Gin lie His Ser Trp Ala Lys Asp Lys Ala Thr Asp 
465 470 475 480 

He Tyr Asn Lys His Asn Ser Ser . 
• 485 



<210> 3 
<211> 3872 
<212> DNA 

<213> Plasmodium falciparum 

<220> 
<221> CDS 

<222> (126) . . (3740) 
<220> 

<221> gene 

<222> (1) . . (3870) 

<220> 

<221> mRNA 

<222> (1) . . (3870) 



<400> 3 

ggtaatatac gtataatata tatataatat attcttacgt atgtatcatt tatgaatcat 60 



CA 02343919 2001-03-22 

WO 00/17233 PCT7EP99/07055 

-6- 

aataatattc taaatttacc ttccgttttt gctcgatctt ctcattttcg tttcagcttt 120 

tatca atg att ttt aat tat gtg ttt ttt aag aac ttt gta cca gtt gtt 170 
Met He Phe Asn Tyr Val Phe Phe Lys Asn Phe Val Pro Val Val 

1 5 10 15 

eta tac att etc ctt ata ata tat att aac tta aat ggc atg aat aat 218 

L u Tyr He Leu Leu He He Tyr He Asn Leu Asn Gly Met Asn Asn 

20 25 30 

aaa aat caa ata aaa aca gaa aaa att tat ata aag aaa ttg aat agg 266 

Lys Asn Gin He Lys Thr Glu Lys He Tyr He Lys Lys Leu Asn Arg 

35 40 45 

ttg tea agg aaa aat teg tta tgt agt tct aaa aat aaa ata gca tgc 314 

Leu Ser Arg Lys Asn Ser Leu Cys Ser Ser Lys Asn Lys He Ala Cys 

50 55 60 

ttg ttc gat ata gga aat gat gat aat aga aat acg aca tat ggc tat 362 

Leu Phe Asp He Gly Asn Asp Asp Asn Arg Asn Thr Thr Tyr Gly Tyr 
65 70 75 

aat gtg aat gtt aaa aat gat gat att aat tec tta eta aaa aat aat 410 

Asn Val Asn Val Lys Asn Asp Asp He Asn Ser Leu Leu Lys Asn Asn 

80 85 90 95 

tat agt aat aaa ttg tac atg gat aag agg aaa aat att aat aat gta 458 

Tyr Ser Asn Lys Leu Tyr Met Asp Lys Arg Lys Asn He Asn Asn Val 

100 105 110 

att agt act aat aaa ata tct ggg tec att tea aat att tgt agt aga 506 

II Ser Thr Asn Lys He Ser Gly Ser He Ser Asn He Cys Ser .Arg 

115 120 125 

aat caa aaa gaa aat gaa caa aaa aga aat aaa caa aga tgt tta act 554 

Asn Gin Lys Glu Asn Glu Gin Lys Arg Asn Lys Gin Arg Cys Leu Thr 

130 135 140 

caa tgt >cac-act tat aat atg -tea - -cat gaa cag gac aaa eta get aat 602 

Gin Cys His Thr Tyr Asn Met Ser His Glu Gin Asp Lys Leu Ala Asn 
145 150 155 

gat aat aat agg aat aat aaa aag aat ttt aat tta tta ttt ata aat 650 

Asp Asn Asn Arg Asn Asn Lys Lys Asn Phe Asn Leu Leu Phe He Asn 

160 165 170 175 

tat ttt aat ttg aaa cga atg aaa aat tct ctt eta aat aaa gac aat 698 

Tyr Phe Asn Leu Lys Arg Met Lys Asn Ser Leu Leu Asn Lys Asp Asn 

180 185 190 

ttc ttt tac tgt aaa gaa aaa aaa ttg tea ttt ctg cat aag gee tat 746 

Phe Phe Tyr Cys Lys Glu Lys Lys Leu Ser Phe Leu His Lys Ala Tyr 

195 200 205 



aaa aaa aaa aat tgc act ttt caa aat tat agt tta aaa aga aaa tct 
Lys Lys Lys Asn Cys Thr Phe Gin Asn Tyr Ser Leu Lys Arg Lys Ser 
210 215 220 



794 
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aat cgt gat tea cat aaa ttg ttt tct gga gaa ttt gac gat tat aca 842 
Asn Arg Asp Ser His Lys Leu Phe Ser Gly Glu Phe Asp Asp Tyr Thr 
225 230 235 

aat aat aat get tta tat gaa tec gaa aaa aaa gaa tac att aca eta 890 
Asn Asn'Asn Ala "Leu Tyr Glu* 'Ser Glu Lys Lys Glu "Tyr lie Thr Leu 
240 245 250 255 

aat aat aat aat aaa aat aat aat aat aaa aat aat gat aat aaa aat 938 
Asn Asn Asn Asn Lys Asn Asn Asn Asn Lys Asn Asn Asp Asn Lys Asn 
260 265 270 

aat gat aat aat gat tat aat aat aat aat agt tgt aat aat tta gga 986 
Asn Asp Asn Asn Asp Tyr Asn Asn Asn Asn Ser Cys Asn Asn Leu Gly 
275 280 285 

gag aga tec aat cat tat gat aat tat ggt gga gat aat aat aat cca 1034 
Glu Arg Ser Asn His Tyr Asp Asn Tyr Gly Gly Asp Asn Asn Asn Pro 
290 295 300 

tgt aat aat aat aat gac aaa tat gat ata gga aaa tat ttc aaa cag 1082 
Cys Asn Asn Asn Asn Asp Lys Tyr Asp lie Gly Lys Tyr Phe Lys Gin 
305 310 315 

att aat ace ttt att aat att gat gaa tat aaa act ata tat ggt gat 1130 
lie Asn Thr Phe He Asn lie Asp Glu Tyr Lys Thr He Tyr Gly Asp 
320 325 330 335 

gaa ata tat aaa gaa ata tat gaa eta tat gta gaa aga aat att cct 1178 
Glu He Tyr Lys Glu He Tyr Glu Leu Tyr Val Glu Arg Asn He Pro 
340 345 350 

gaa tat tat gaa cga aaa tat ttt tea gaa gat att aaa aag agt gtc 1226 
Glu Tyr Tyr Glu Arg Lys Tyr Phe Ser Glu Asp He Lys Lys Ser Val 
355 360 365 

eta ttt gat ata gat aaa tat aat gat gtc gaa ttt gaa aaa get ata 1274 
Leu Phe Asp He Asp Lys Tyr Asn Asp Val Glu Phe Glu Lys Ala He 
370 375 380 

aaa gaa gaa ttt ata aat aat gga gtt tat att aat aat ata gat aat 1322 
Lys Glu Glu Phe He Asn Asn Gly Val Tyr He Asn Asn He Asp Asn 
385 390 395 

aca tat tat aaa aaa gaa aat att tta ata atg aaa aag ata tta cat 1370 
Thr Tyr Tyr Lys Lys Glu Asn lie Leu He Met Lys Lys He Leu His 
400 405 410 415 

tat ttc cca tta tta aaa tta att aat aat cca tea gat tta aaa aag 1418 
Tyr Phe Pro Leu Leu Lys Leu He Asn Asn Pro Ser Asp Leu Lys Lys 
420 425 430 

tta aaa aaa caa tat tta cct tta tta gca cat gaa tta aaa ata ttt 14 66 
Leu Lys Lys Gin Tyr Leu Pro Leu Leu Ala His Glu Leu Lys He Phe 
435 440 445 
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tta ttt ttt att gta aat ata aca gga ggt cat ttt tec tct gtt tta 1514 
Leu Phe Ph He Val Asn lie Thr Gly Gly His Phe Ser Ser Val Leu 
450 455 460 

age tct tta gaa att caa tta tta tta ttg tat att ttt aat caa cca 1562 
- Sex Ser Leu Glu He Gin Leu Leu Leu Leu Tyr He - Phe Aan- Gln Pro 
465 470 475 

tat gat aat gtt ata tat gat ata gga cat caa gca tat gta cat aag 1610 
Tyr Asp Asn Val He Tyr Asp He Gly His Gin Ala Tyr Val His Lys 
480 485 490 495 

ata ttg acc gga aga aaa eta tta ttt eta tea tta aga aat aaa aaa 1658 
He Leu Thr Gly Arg Lys Leu Leu Phe Leu Ser Leu Arg Asn Lys Lys 
500 505 510 

ggt att agt gga ttc eta aat att ttt gaa agt att tat gat aaa ttt 1706 
Gly He Ser Gly Phe Leu Asn He Phe Glu Ser He Tyr Asp Lys Phe 
515 520 525 

ggg get ggt cac agt tec act tea tta agt get ata caa gga tat tat 1754 
Gly Ala Gly His Ser Ser Thr Ser Leu Ser Ala He Gin Gly Tyr Tyr 
530 535 540 

gaa gee gag tgg caa gtg aag aat aaa gaa aaa tat gga aat gga gat 1802 
Glu Ala Glu Trp Gin Val Lys Asn Lys Glu Lys Tyr Gly Asn Gly Asp 
545 550 555 

ata gaa ata agt gat aac gca aat gtc acg aat aat gaa agg ata ttt 1850 
He Glu He Ser Asp Asn Ala Asn Val Thr Asn Asn Glu Arg He Phe 
560 565 570 575 

caa aaa gga ata cac aat gat aat aat att aac aat aat att aat aat 1898 
Gin Lys Gly He His Asn Asp Asn Asn He Asn Asn Asn lie Asn Asn 
580 585 590 

aat aat tat ate aat cct tea gat gtg gta gga aga gaa aat acg aat 194 6 
Asn Asn Tyr He Asn Pro Ser Asp Val Val Gly Arg Glu Asn Thr Asn 
595 600 605 

gta cca aat gta cga aat gat aac cat aac gtg gat aaa gta cac att 1994 
Val Pro Asn Val Arg Asn Asp Asn His Asn Val Asp Lys Val His He 
610 615 620 

get att ata gga gat ggt ggt tta aca ggt gga atg gca tta gaa gcg 2042 
Ala He He Gly Asp Gly Gly Leu Thr Gly Gly Met Ala Leu Glu Ala 
625 630 635 

tta aat tat att tea ttc ttg aat tct aaa att tta att att tat aat 2090 
Leu Asn Tyr He Ser Phe Leu Asn Ser Lys He Leu He He Tyr Asn 
640 645 650 655 

gat aac gga caa gtt tct tta cca aca aat gee gta agt ata tea ggt 2138 
Asp Asn Gly Gin Val Ser Leu Pro Thr Asn Ala Val Ser He Ser Gly 
660 665 670 



CA 02343919 2001-03-22 



WO 00/17233 PCT/EP99/07055 

-9- 



aat aga cct ata ggt tct ata tea gat cat tta cat tat ttt gtt tct 2186 
Asn Arg Pro lie Gly Ser lie Ser Asp His Leu His Tyr Phe Val Ser 
675 680 685 

aat ata gaa gca aat get ggt gat aat aaa tta teg aaa aat gca aaa 2234 
Asn lie Glu Ala Asn Ala Gly Asp Asn Lys Leu Ser Lys Asn Ala "Lys 
690 695 700 

gag aat aac att ttt gaa aat ttg aat tat gat tat att ggt gtt gtg 2282 
Glu Asn Asn lie Phe Glu Asn Leu Asn Tyr Asp Tyr lie Gly Val Val 
705 710 715 

aat ggt aat aat aca gaa gag etc ttt aaa gta tta aat aat ata aaa 2330 
Asn Gly Asn Asn Thr Glu Glu Leu Phe Lys Val Leu Asn Asn lie Lys 
720 725 730 735 

gaa aat aaa tta aaa aga get act gtt ctt cat gta cgt aca aaa aaa 2378 
Glu Asn Lys Leu Lys Arg Ala Thr Val Leu His Val Arg Thr Lys Lys 
740 745 750 

teg aat gat ttt ata aat tea aag agt cca ata agt ata ttg cac tct 2426 
Ser Asn Asp Phe lie Asn Ser Lys Ser Pro lie Ser lie Leu His Ser 
755 760 765 

ata aag aaa aat gag att ttc cct ttc gat acc act ata tta aat gga 2474 
lie Lys Lys Asn Glu He Phe Pro Phe Asp Thr Thr He Leu Asn Gly 
770 775 780 

aat att cat aag gag aac aag ata gaa gaa gag aaa aat gtg tct tea 2522 
Asn He His Lys Glu Asn Lys He Glu Glu Glu Lys Asn Val Ser Ser 
785 790 795 

tct aca aag tat gat gta aat aat aag aat aat aaa aat aat gat aat 2570 
Ser Thr Lys Tyr Asp Val Asn Asn Lys Asn Asn Lys Asn Asn Asp Asn 
800 805 810 815 

agt gaa att ata aaa tat gaa gat atg ttt tea aaa gag acg ttc aca 2618 
Ser Glu He He Lys Tyr Glu Asp Met Phe Ser Lys Glu Thr Phe Thr 
«20 825 830 

gat ata tat aca aat gaa atg tta aaa tat tta aag aaa gat aga aat 2666 
Asp He Tyr Thr Asn Glu Met Leu Lys Tyr Leu Lys Lys Asp Arg Asn 
835 840 845 

ata ata ttc eta tct ccc get atg tta gga gga tea gga ttg gtt aaa 2714 
He He Phe Leu Ser Pro Ala Met Leu Gly Gly Ser Gly Leu Val Lys 
850 855 860 

att agt gag cgt tat cca aat aat gta tat gat gta ggt ata gca gaa 2762 
He Ser Glu Arg Tyr Pro Asn Asn Val Tyr Asp Val Gly He Ala Glu 
865 870 875 

caa cat tct gta act ttc gca gca get atg gca atg aat aag aaa tta 2810 
Gin His Ser Val Thr Phe Ala Ala Ala Met Ala Met Asn Lys Lys Leu 
880 885 890 895 
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aaa ata caa tta tgt ata tat teg acc ttt tta caa aga gca tat gat 
Lys He Gin Leu Cys He Tyr Ser Thr Phe Leu Gin Arg Ala Tyr Asp 
900 905 910 



2858 



caa att ata cat gat ctt aat tta caa aat ata cct tta aag gtt ata 
Gin He - lie His- Step -lreu Asn Lbu Gin Asn lie Pro : Leu - fcys Val lie 
915 920 925 



2906 



att gga aga agt gga tta gta gga gag gat ggg gca aca cat caa ggt 2954 
He Gly Arg Ser Gly Leu Val Gly Glu Asp Gly Ala Thr His Gin Gly 
930 935 940 

ata tat gat tta tct tat ctt ggg aca ctt aac aat gca tat ata ata 3002 
He Tyr Asp Leu Ser Tyr Leu Gly Thr Leu Asn Asn Ala Tyr He He 
945 950 955 

tct cca agt aat caa gtt gat ttg aaa aga get ctt agg ttt get tat 3050 
Ser Pro Ser Asn Gin Val Asp Leu Lys Arg Ala Leu Arg Phe Ala Tyr 
960 965 970 975 

tta gat aag gac cat tct gtg tat ata cgt ata ccc aga atg aac ata 3098 
Leu Asp Lys Asp His Ser Val Tyr He Arg He Pro Arg Met Asn He 
980 985 990 



tta agt gat aag tac atg aaa gga tat ttg aac att cat atg aaa aat 
L u Ser Asp Lys Tyr Met Lys Gly Tyr Leu Asn He His Met Lys Asn 
995 1000 1005 



3146 



gag age aaa aat ate gat gta aac gtg gat ata aac gat gat gta gat 
Glu Ser Lys Asn He Asp Val Asn Val Asp He Asn Asp Asp Val Asp 
1010 1015 1020 



3194 



aaa tat agt gaa gaa tat atg gac gat gat aat ttt ata aaa teg ttt 
Lys Tyr Ser Glu Glu Tyr Met Asp Asp Asp Asn Phe He Lys Ser Phe 
1025 1030 1035 



3242 



att gga aaa tct aga att att aaa atg gat aat gaa aat aat aat aca 
He Gly Lys Ser Arg He He Lys Met Asp Asn Glu Asn Asn Asn Thr 
1-040 1*045 1050 10-55 



3290 



aat gaa cat tat tea age aga gga gat aca cag aca aaa aaa aaa aaa 
Asn Glu His Tyr Ser Ser Arg Gly Asp Thr Gin Thr Lys Lys Lys Lys 
1060 1065 1070 



3338 



gtt tgt ate ttt aac atg ggt agt atg ctt ttt aat gta att aat get 
Val Cys He Phe Asn Met Gly Ser Met Leu Phe Asn Val He Asn Ala 
1075 1080 1085 



33bo 



ata aaa gaa att gaa aaa gaa caa tat att tea cat aat tat tct ttt 
He Lys Glu He Glu Lys Glu Gin Tyr He Ser His Asn Tyr Ser Phe 
1090 1095 1100 



3434 



tea att gtt gat atg ata ttt tta aat cct tta gat aaa aat atg ata 
Ser He Val Asp Met He Phe Leu Asn Pro Leu Asp Lys Asn Met He 
1105 1110 1115 



3482 
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gat cat gta ata aaa caa aat aaa cat caa tat tta att act tat gaa 3530 
Asp His Val He Lys Gin Asn Lys His Gin Tyr Leu He Thr Tyr Glu 
1120 1125 1130 1135 

gat aat act ata ggt ggt ttt tct aca cat ttc aat aat tat tta ata 3578 
*Asp Asn -Thr lie Gly 'Gly -Phe- : Ser Thr* Hi'S 'Phe-Asn Asn Tyr Leu lie 
1140 1145 1150 

gaa aat aat tat att aca aaa cat aac tta tat gtt cat aat att tat 3626 
Glu Asn Asn Tyr He Thr Lys His Asn Leu Tyr Val His Asn He Tyr 
1155 1160 1165 

tta tct aat gag cca att gaa cat gca tct ttt aag gat caa caa gaa '3674 
Leu Ser Asn Glu Pro He Glu His Ala Ser Phe Lys Asp Gin Gin Glu 
1170 1175 1180 

gtc gtc aaa atg gat aaa tgt agt ctt gtc aat aga att aaa aat tat 3722 
Val Val Lys Met Asp Lys Cys Ser Leu Val Asn Arg He Lys Asn Tyr 
1185 1190 1195 

ctt aaa aat aat cct aca tgatgtaaga taaatatata tttctaaaat 3770 
Leu Lys Asn Asn Pro Thr 
1200 1205 

tatttttttt ttatacttta atgtgtacaa taaaatatat atctaaatat attttatttg 3830 
tacgcttttt tttttttttt tttaattgtt atttttgtat at 3872 

<210> 4 
<211> 1205 
<212> PRT 

<213> Plasmodium* falciparum 
<400> 4 

Met He Phe Asn Tyr Val Phe Phe Lys Asn Phe Val Pro Val Val Leu 
1 5 10 15 

-Tyr lie Leu Leu lie lie Tyr He Asn Leu Asn Gly Met Asn Asn Lys 
20 25 30 

Asn Gin He Lys Thr Glu Lys He Tyr He Lys Lys Leu Asn Arg Leu 
35 40 45 

Ser Arg Lys Asn Ser Leu Cys. Ser Ser Lys Asn Lys He Ala Cys Leu 
50 55 60 

Phe Asp He Gly Asn Asp Asp Asn Arg Asn Thr Thr Tyr Gly Tyr Asn 
65 70 75 80 

Val Asn Val Lys Asn Asp Asp He Asn Ser Leu Leu Lys Asn Asn Tyr 
85 90 95 

Ser Asn Lys Leu Tyr Met Asp Lys Arg Lys Asn He Asn Asn Val He 
100 105 110 



Ser Thr Asn Lys lie Ser Gly Ser He Ser Asn He Cys Ser Arg Asn 
115 120 125 
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Gln Lys Glu Asn Glu Gin Lys Arg Asn Lys Gin Arg Cys Leu Thr Gin 
130 135 140 

Cys His Thr Tyr Asn Met Ser His Glu Gin Asp Lys Leu Ala Asn Asp 
145 15^0 155 160 

Asn Asn Arg Asn Asn Lys Lys Asn Phe Asn Leu Leu Phe lie Asn Tyr 
165 ■ 170 175 

Phe Asn Leu Lys Arg Met Lys Asn Ser Leu Leu Asn Lys Asp Asn Phe 
180 185 190 

Phe Tyr Cys Lys Glu Lys Lys Leu Ser Phe Leu His Lys Ala Tyr Lys 
195 200 205 

Lys Lys Asn Cys Thr Phe Gin Asn Tyr Ser Leu Lys Arg Lys Ser Asn 
210 215 220 

Arg Asp Ser His Lys Leu Phe Ser Gly Glu Phe Asp Asp Tyr Thr Asn 
225 230 235 240 

Asn Asn Ala Leu Tyr Glu Ser Glu Lys Lys Glu Tyr lie Thr Leu Asn 
245 250 255 

Asn Asn Asn -Lys Asn Asn Asn Asn Lys Asn Asn Asp Asn Lys Asn Asn 
260 265 270 

Asp Asn Asn Asp Tyr Asn Asn Asn Asn Ser Cys Asn Asn Leu Gly Glu 
275 280 285 

Arg Ser Asn His Tyr Asp Asn Tyr Gly Gly Asp Asn Asn Asn Pro Cys 
290 295 300 

Asn Asn Asn Asn Asp Lys Tyr Asp lie Gly Lys Tyr Phe Lys Gin lie 
305 310 315 320 

Asn Thr Phe lie Asn lie Asp Glu Tyr Lys Thr lie Tyr Gly Asp Glu 
325 330 335 

lie Tyr Lys Glu lie Tyr Glu Leu Tyr Val Glu Arg Asn lie Pro Glu 
340 345 350 

Tyr Tyr Glu Arg Lys Tyr Phe Ser Glu Asp He Lys Lys Ser Val Leu 
355 360 365 

Phe Asp He Asp Lys Tyr Asn Asp Val Glu Phe Glu Lys Ala He Lys 
370 375 380 

Glu Glu Phe He Asn Asn Gly Val Tyr He Asn Asn He Asp Asn Thr 
385 390 395 400 

Tyr Tyr Lys Lys Glu Asn He Leu He Met Lys Lys He Leu His Tyr 
405 410 415 

Phe Pro Leu Leu Lys Leu He Asn Asn Pro Ser Asp Leu Lys Lys Leu 
420 425 430 
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Lys Lys Gin Tyr Leu Pro Leu Leu Ala His Glu Leu Lys lie Phe Leu 
435 440 445 

Phe Phe lie Val Asn lie Thr Gly Gly His Phe Ser Ser Val Leu Ser 
450 455 460 

Ser Leu Glu lie Gin Leu Leu Leu Leu Tyr lie Phe Asn Gin Pro Tyr 
465 470 475 480 

Asp Asn Val He Tyr Asp lie Gly His Gin Ala Tyr Val His Lys He 
485 490 495 

"Leu Thr-'Gly Arg Xys Leu Leu Phe Leu Ser Leu Arg Asn Lys Lys Gly 
500 505 510 

He Ser Gly Phe Leu Asn He Phe Glu Ser He Tyr Asp Lys Phe Gly 
515 520 525 

Ala Gly His Ser Ser Thr Ser Leu Ser Ala He Gin Gly Tyr Tyr Glu 
530 535 540 

Ala Glu Trp Gin Val Lys Asn Lys Glu Lys Tyr Gly Asn Gly Asp He 
545 550 555 560 

Glu He Ser Asp Asn Ala Asn Val Thr Asn Asn Glu Arg He Phe Gin 
565 '• 570 575 

Lys Gly He His Asn Asp Asn Asn He Asn Asn Asn He Asn Asn Asn 
580 585 590 

Asn Tyr He Asn Pro Ser Asp Val Val Gly Arg Glu Asn Thr Asn Val 
595 600 605 

Pro Asn Val Arg Asn Asp Asn His Asn Val Asp Lys Val His He Ala 
610 615 - 620 

He He Gly Asp Gly Gly Leu Thr Gly Gly Met Ala Leu Glu Ala Leu 
625 630 635 640 

Asn Tyr He Ser Phe Leu Asn Ser Lys He Leu He He Tyr Asn Asp 
645 650 655 

Asn Gly Gin Val Ser Leu Pro Thr Asn Ala Val Ser He Ser Gly Asn 
660 665 670 

Arg Pro lie Gly Ser He Ser Asp His Leu His Tyr Phe Val Ser Asn 
675 680 685 

He Glu Ala Asn Ala Gly Asp Asn Lys Leu Ser Lys Asn Ala Lys Glu 
690 695 700 

Asn Asn He Phe Glu Asn Leu Asn Tyr Asp Tyr He Gly Val Val Asn 
705 710 715 720 

Gly Asn Asn Thr Glu Glu Leu Phe Lys Val Leu Asn Asn lie Lys Glu 
725 730 735 
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Asn Lys Leu Lys Arg Ala Thr Val Leu His Val Arg Thr Lys Lys S r 
740 745 750 

Asn Asp Phe lie Asn Ser Lys Ser Pro lie Ser lie Leu His Ser lie 
755 760 765 

Lys Lys Asn Glu lie Phe Pro Phe Asp Thr Thr lie Leu Asn Gly Asn 
770 775 780 

lie His Lys Glu Asn Lys lie Glu Glu Glu Lys Asn Val Ser Ser Ser 
785 790 795 800 

Thr Lys Tyr Asp Val Asn Asn Lys Asn Asn Lys Asn Asn Asp Asn Ser 
805 810 815 

Glu lie lie Lys Tyr Glu Asp Met Phe Ser Lys Glu Thr Phe Thr Asp 
820 825 830 

He Tyr Thr Asn Glu Met Leu Lys Tyr Leu Lys Lys Asp Arg Asn He 
835 840 845 

He Phe Leu Ser Pro Ala Met Leu Gly Gly Ser Gly Leu Val Lys He 
850 855 860 

Ser Glu Arg Tyr Pro Asn Asn Val Tyr Asp Val Gly He Ala Glu Gin 
865 870 875 880 

His Ser Val Thr Phe Ala Ala Ala Met Ala Met Asn Lys Lys Leu Lys 
885 890 895 

He Gin Leu Cys He Tyr Ser Thr Phe Leu Gin Arg Ala Tyr Asp Gin 
900 905 .910 

He He His Asp Leu Asn Leu Gin Asn He Pro Leu Lys Val He He 
915 920 925 

Gly Arg Ser Gly Leu Val Gly Glu Asp Gly Ala Thr His Gin Gly He 
930 935 940 

Tyr Asp Leu Ser Tyr Leu Gly Thr Leu Asn Asn Ala Tyr He He Ser 
945 950 955 960 

Pro Ser Asn Gin Val Asp Leu Lys Arg Ala Leu Arg Phe Ala Tyr Leu 
965 970 975 

Asp Lys Asp His Ser Val Tyr He Arg He Pro Arg Met Asn He Leu 
980 985 990 

Ser Asp Lys Tyr Met Lys Gly Tyr Leu Asn He. His Met Lys Asn Glu 
995 1000 1005 

Ser Lys Asn He Asp Val Asn Val Asp He Asn Asp Asp Val Asp Lys 
1010 1015 1020 

Tyr Ser Glu Glu Tyr Met Asp Asp Asp Asn Phe He Lys Ser Phe He 
025 1030 1035 1040 
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Gly Lys Ser Arg He He Lys Met Asp Asn Glu Asn Asn Asn Thr Asn 
1045 1050 1055 

Glu His Tyr Ser Ser Arg Gly Asp Thr Gin Thr Lys Lys Lys Lys Val 
1060 1065 1070 

Cys He Phe Asn Met Gly Ser Met Leu Phe Asn Val He Asn Ala He 
1075 1080 1085 

Lys Glu He Glu Lys Glu Gin Tyr He Ser His Asn Tyr Ser Phe Ser 
1090 1095 1100 

He Val Asp Met lie Phe Leu Asn Pro Leu Asp Lys Asn Met He Asp 
105 1110 1115 1120 

His Val He Lys Gin Asn Lys His Gin Tyr Leu He Thr Tyr Glu Asp 
1125 1130 1135 

Asn Thr He Gly Gly Phe Ser Thr His Phe Asn Asn Tyr Leu He Glu 
1140 1145 1150 

Asn Asn Tyr He Thr Lys His Asn Leu Tyr Val His Asn He Tyr Leu 
1155 1160 1165 

Ser Asn Glu Pro He Glu His Ala Ser Phe Lys Asp Gin Gin Glu Val 
1170 1175 1180 

Val Lys Met Asp Lys Cys Ser Leu Val Asn Arg He Lys Asn Tyr Leu 
185 1190 1195 1200 

Lys Asn Asn Pro Thr 
1205 



<210> 5 
<211> 3147 
<212> DNA 

<213> Plasmodium falciparum 

<220> 
<221> CDS 

<222> (199) (2670) 
<400> 5 

tttcattttt ctttacccac atatatatat atatatatat aatatatata tataatatta 60 

tatatttgat atatgattta aaattgtaac ataaaaaaaa taattatatt aaatatgtgt 120 

atacatctcc aacatataaa tattattttt tattattatt tttttttttt tttttcataa 180 

tgcctgaata accacaaa atg agt tat ata aaa aga ctg att ctt ttt atg 231 
Met Ser Tyr He Lys Arg Leu He Leu Phe Met 
1 5 10 

tta ctg ttt tat tct cat gta aaa att aaa aaa tta ttt att aaa att 279 
L u Leu Ph Tyr Ser His Val Lys He Lys Lys Leu Ph He Lys He 
15 20 25 
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tct aat gta aac ata ttt ttt gca gaa gca aag aaa aat gga aaa aag 327 
Ser Asn Val Asn He Phe Phe Ala Glu Ala Lys Lys Asn Gly Lys Lys 
30 35 40 

gaa ■ ttc ttt "Ctt -ttt tt a eta .aat ata aaa aaa aat age caa cag aaa 375 
Glu Phe Phe Leu Phe Leu Leu Asn He Lys Lys Asn Ser Gin Gin Lys 
45 50 55 

aaa act tat cat att acc aaa agg aat acc ata aat aaa agt gat ttt 423 
Lys Thr Tyr His He Thr Lys Arg Asn Thr He Asn Lys Ser Asp Phe 
60 65 70 75 

tta tat tct tta eta aat gaa gaa ggg aat tct tea aaa aag gaa tat 471 
Leu Tyr Ser Leu Leu Asn Glu Glu Gly Asn Ser Ser Lys Lys Glu Tyr 
80 85 90 

aaa aat tta aaa gat gaa gaa aaa tat aat ate ata caa aat ata aaa 519 
Lys Asn Leu Lys Asp Glu Glu Lys Tyr Asn He He Gin Asn He Lys 
95 100 105 

aaa tat tgt gaa tgt act aaa aaa tat aaa agg etc cca aca cga gaa 567 
Lys Tyr Cys Glu Cys Thr Lys Lys Tyr Lys Arg Leu Pro Thr Arg Glu 
110 115 120 

gta gtt att gga aat gtt aaa att gga gga aat aat aaa ata get att 615 
Val Val He Gly Asn Val Lys He Gly Gly Asn Asn Lys He Ala He 
125 130 135 

caa act atg get age tgt gat aca aga aat gta gaa gaa tgt gta tat 663 
Gin Thr Met Ala Ser Cys Asp Thr Arg Asn Val Glu Glu Cys Val Tyr 
140 145 150 155 

caa att aga aaa tgt aaa gat ttg ggt get gac att gta agg ttg act 711 
Gin He Arg Lys Cys Lys Asp Leu Gly Ala Asp He Val Arg Leu Thr 
160 165 170 

gtt caa gga gtt caa gaa gca caa get agt tat cat att aaa gaa aaa 759 
Val -Glu Gly Val^Gln Glu Ala <31n Ala Ser Tyr His He Lys Glu Lys 
175 180 185 

tta tta tct gaa aat gta aat ate cca tta gta gca gat att cat ttt 807 
Leu Leu Ser Glu Asn Val Asn He Pro Leu Val Ala Asp He His Phe 
190 195 200 

aat cct aaa ata get tta atg gca get gat gtg ttt gaa aaa att cga 855 
Asn Pro Lys He Ala Leu Met Ala Ala Asp Val Phe Glu Lys lie Arg 
205 210 215 

gtg aat cca gga aat tat gtt gat gga aga aaa aaa tgg ata gat aaa 903 
Val Asn Pro Gly Asn Tyr Val Asp Gly Arg Lys Lys Trp He Asp Lys 
220 225 230 235 

gtt tat aaa aat aaa gaa gaa ttt gat gaa ggg aaa tta ttt ata aaa 951 
Val Tyr Lys Thr Lys Glu Glu Phe Asp Glu Gly Lys Leu Phe lie Lys 
240 245 250 
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gaa aaa ttt gta cca tta att gaa aaa tgt aaa aga tta aat aga gca 999 
Glu Lys Phe Val Pro Leu II Glu Lys Cys Lys Arg Leu Asn Arg Ala 

255 260 265 

ata aga att gga aca aat cat gga tec ctt tea tct cga gta tta tea 1047 
Tie Arg Tie X3iy Thr Asn His Gly Ser Leu Ser Ser Arg Val Leu Ser 
270 275 280 

tat tat gga gat aca cca tta ggt atg gta gaa teg get ttt gag ttt 1095 

Tyr Tyr Gly Asp Thr Pro Leu Gly Met Val Glu Ser Ala Phe Glu Phe 
285 290 295 

tct gat tta tgt att gaa aac aat ttt tac aat ctt gtt ttt tct atg 1143 

Ser Asp Leu Cys lie Glu Asn Asn Phe Tyr Asn Leu Val Phe Ser Met 
300 305 310 315 

aaa get tct aat get tat gtt atg ata caa tct tat aga tta tta gta 1191 

Lys Ala Ser Asn Ala Tyr Val Met lie Gin Ser Tyr Arg Leu Leu Val 
320 325 330 

tct aaa caa tat gaa aga aat atg atg ttc cct ata cat tta gga gtt 1239 

Ser Lys Gin Tyr Glu Arg Asn Met Met Phe Pro lie His Leu Gly Val 

335 340 345 

aca gaa gca gga ttt ggt gat aat gga aga ata aaa tct tat tta ggt 1287 

Thr Glu Ala Gly Phe Gly Asp Asn Gly Arg lie Lys Ser Tyr Leu Gly 
350 355 360 

ata gga tct tta tta tat gat ggt ata gga gat acc att cgt ata tec 1335 

lie Gly Ser Leu Leu Tyr Asp Gly lie Gly Asp Thr He Arg He Ser 
365 370 375 

tta aca gaa gat cct tgg gaa gag tta act cct tgt aaa aaa tta gtt 1383 

Leu Thr Glu Asp Pro Trp Glu Glu Leu Thr Pro Cys Lys Lys Leu Val 
380 385 390 395 

gaa aat tta aag aaa aga ata ttt tat aat gaa aat ttt aaa gaa gat 1431 

Glu Asn Leu Lys Lys Arg He Phe Tyr Asn Glu Asn Phe Lys Glu Asp 
400 405 410 

aat gaa tta aaa aat aat gaa atg gat acc aaa aat eta tta aat ttt 1479 

Asn Glu Leu Lys Asn Asn Glu Met Asp Thr Lys Asn Leu Leu Asn Phe 

415 420 425 

gaa gaa aat tat cga aat ttt aat aat ata aaa aaa aga aat gta gaa 1527 

Glu Glu Asn Tyr Arg Asn Phe Asn Asn He Lys Lys Arg Asn Val Glu 
430 435 440 

aaa aat aat aat gta tta cat gaa gag tgc act ata ggt aat gta gta 1575 

Lys Asn Asn Asn Val Leu His Glu Glu Cys Thr He Gly Asn Val Val 
445 450 455 

acc ata aaa gag tta gaa gat tct ctg caa att ttt aaa gat tta aat 1623 

Thr He Lys Glu Leu Glu Asp Ser Leu Gin He Phe Lys Asp Leu Asn 
460 465 470 475 
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tta gaa gta gat tea aat gga aat ttg aaa aag gga gec aaa aca act 1671 
Leu Glu Val Asp Ser Asn Gly Asn Leu Lys Lys Gly Ala Lys Thr Thr 
480 485 490 

gat atg gtt att ata aat gat ttt cat aat ata aca aat tta gga aaa 1719 
Asp Met Val He lie Asn Asp Phe His Asn He Thr Asn Xeu Gly "Lys 
495 500 505 

aaa act gtg gat aaa tta atg caa gtg gga att aat ata gta gtt caa 1767 
Lys Thr Val Asp Lys Leu Met Gin Val Gly He Asn He Val Val Gin 
510 515 520 

tat gaa cca cat aat ata gaa ttt ata gaa aaa atg gaa cca aat aat 1815 
Tyr Glu Pro His Asn He Glu Phe He Glu Lys Met Glu Pro Asn Asn 
525 530 535 

gat aat aat aat aat aat aat aat aat aat ata tta ttt tat gtg gat 1863 
Asp Asn Asn Asn Asn Asn Asn Asn Asn Asn lie Leu Phe Tyr Val Asp 
540 545 550 555 

ata aaa aat att atg aac agt tea gaa aaa aat att aaa tta agt aat 1911 
He Lys Asn He Met Asn Ser Ser Glu Lys Asn He Lys Leu Ser Asn 
560 565 570 

tct aaa gga tat gga tta att tta aac gga aaa gaa gat ata caa acc 1959 
Ser Lys Gly Tyr Gly Leu He Leu Asn Gly Lys Glu Asp He Gin Thr 
575 580 585 

ata aaa aaa ata aaa gaa tta aat cgt cgt cct tta ttc att eta tta 2007 
He Lys Lys He Lys Glu Leu Asn Arg Arg Pro Leu Phe He Leu Leu 
590 595 600 

aaa tea gat aac ata tat gaa cat gta tta ata acc aga aga att aat 2055 
Lys Ser Asp Asn He Tyr Glu His Val Leu He Thr Arg Arg He Asn 
605 610 615 

gaa ctt tta caa tec tta aat ata aat ata cct tat ata cat tat gtt 2103 
Glu Leu Leu Gin Ser Leu Asn He Asn He Pro Tyr lie His Tyr Val 
620 625 630 635 

gat att aat tea aac aat tat gat gat ata tta gtt aat tea aca tta 2151 
Asp He Asn Ser Asn Asn Tyr Asp Asp He Leu Val Asn Ser Thr Leu 
640 645 650 

tat gca gga agt tgt ttg atg gat tta atg ggg gat ggt ctt att gtt 2199 
Tyr Ala Gly Ser Cys Leu Met Asp Leu Met Gly Asp Gly Leu He Val 
655 660 665 

aac gta act aat gat gtt ctt aca aat aaa aaa aag ata gaa aca aaa 2247 
Asn Val Thr Asn Asp Val Leu Thr Asn Lys Lys Lys lie Glu Thr Lys 
670 675 680 

tat gat gaa aaa gaa gaa gta gag gaa gag gga aac aat aaa gat att 2295 
Tyr Asp Glu Lys Glu Glu Val Glu Glu Glu Gly Asn Asn Lys Asp lie 
685 690 695 
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cat aga ctt ttg age aga gtt gca tta aat tea ttt tta aca tta aat 2343 
His Arg Leu Leu Ser Arg Val Ala L u Asn Ser Phe Leu Thr Leu Asn 
700 705 710 715 

att tta caa gat aca aga ata cgt tta ttt aaa aca gat tat ata gec 2391 
lie Leu 'Gin "Asp Thr Arg lie Arg Leu Phe Lys Thr Asp Tyr lie Ala 
720 725 730 

tgc cca tct tgt gga aga act tta ttt aat ata caa gaa act act aaa 2439 
Cys Pro Ser Cys Gly Arg Thr Leu Phe Asn lie Gin Glu Thr Thr Lys 
735 740 745 

aaa att atg aaa tta aca ggg cac tta aaa ggc gtt aaa att gca gtc 2487 
Lys He Met Lys Leu Thr Gly His Leu Lys Gly Val Lys He Ala Val 
750 755 760 

atg gga tgt att gtt aat ggt ata gga gaa atg gca gat gca cat ttt 2535 
Met Gly Cys He Val Asn Gly He Gly Glu Met Ala Asp Ala His Phe 
765 770 775 

ggt tat gtt ggt agt gca cct aaa aaa att gat tta tat tat ggt aaa 2583 
Gly Tyr Val Gly Ser Ala Pro Lys Lys He Asp Leu Tyr Tyr Gly Lys 
780 785 790 795 

gag tta gta gaa aga aat ata cct gag gaa gaa get tgt gat aaa ttg 2631 
Glu Leu Val Glu Arg Asn He Pro Glu Glu Glu Ala Cys Asp Lys Leu 
800 805 810 

ata gaa tta att aaa aaa cat aac aaa tgg aaa gat cca taaattgaat 2660 
He Glu Leu He Lys Lys His Asn Lys Trp Lys Asp Pro 
815 820 



atggacaagt 


atttatttat 


ttatttatct 


tatatataat atattataaa 


tttttcgatg 


2740 


tatttttcct 


tttaaaattt 


tatttttttt 


ttattttttt ttttgaagta 


atatttataa 


2800 


tgcatacata 


atattaaaat 


gtgtattata 


taataatatc attttattgt 


tattttaaaa 


2860 


gactaatacc 


aagaacaatt 


ttttaataat 


cattcttata acttgttaaa tatatatata 


2920 


tatatatata 


tatttattta 


tttatattta 


tatttattta tttttggtat 


atgaaaagta 


2980 


aaaatataat 


aatttaaaag 


tatttacaaa 


ataaataata ttatatatct 


gtttttatat 


3040 


atatgttaat 


ggaaaaggag 


aaaataaata 


aataaaacaa acaaaataac 


atatatatat 


3100 


atatatatat 


actgaatgag 


aaagaaaaaa 


aaaagaaaag gatacga 




3147 



<210> 6 
<211> 824 
<212> PRT 

<213> Plasmodium falciparum 
<400> 6 

Met Ser Tyr He Lys Arg Leu He L u Ph Met Leu Leu Phe Tyr Ser 
15 10 15 
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His Val Lys He Lys Lys Leu Phe He Lys He Ser Asn Val Asn lie 
20 25 30 

Phe Phe Ala Glu Ala Lys Lys Asn Gly Lys Lys Glu Phe Phe Leu Phe 
35 40 45 

Leu Leu Asn He Lys Lys Asn Ser Gin Gin Lys Lys Thr Tyr His He 
50 55 60 

Thr Lys Arg Asn Thr lie Asn Lys Ser Asp Phe Leu Tyr Ser Leu Leu 
65 70 75 80 

Asn Glu Glu Gly Asn Ser Ser liys Xys Glu Tyr Lys Asn Leu Lys Asp 
85 90 95 

Glu Glu Lys Tyr Asn He He Gin Asn He Lys Lys Tyr Cys Glu Cys 
100 105 110 

Thr Lys Lys Tyr Lys Arg Leu Pro Thr Arg Glu Val Val He Gly Asn 
115 120 125 

Val Lys He Gly Gly Asn Asn Lys He Ala He Gin Thr Met Ala Ser 
130 135 140 

Cys Asp Thr Arg Asn Val Glu Glu Cys Val Tyr Gin He Arg Lys Cys 
145 150 155 160 

Lys Asp Leu Gly Ala Asp He Val Arg Leu Thr Val Gin Gly Val Gin 
165 170 175 

Glu Ala Gin Ala Ser Tyr His He Lys Glu Lys Leu Leu Ser Glu Asn 
180 185 190 

Val Asn He Pro Leu Val Ala Asp He His Phe Asn Pro Lys He Ala 
195 200 205 

Leu Met Ala Ala Asp Val Phe Glu Lys He Arg Val Asn Pro Gly Asn 
210 215 220 

Tyr Val Asp Gly Arg Lys Lys Trp He Asp Lys Val Tyr Lys Thr Lys 
225 230 235 240 

Glu Glu Phe Asp Glu Gly Lys Leu Phe He Lys Glu Lys Phe Val Pro 
245 250 255 

L u He Glu Lys Cys Lys Arg Leu Asn Arg Ala He Arg He Gly Thr 
260 265 270 

Asn His Gly Ser Leu Ser Ser Arg Val Leu Ser Tyr Tyr Gly Asp Thr 
275 280 285 

Pro Leu Gly Met Val Glu Ser Ala Phe Glu Phe Ser Asp Leu Cys lie 
290 295 300 

Glu Asn Asn Phe Tyr Asn Leu Val Phe Ser Met Lys Ala Ser Asn Ala 
305 310 315 320 

Tyr Val Met lie Gin Ser Tyr Arg Leu Leu Val Ser Lys Gin Tyr Glu 
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325 330 335 

Arg Asn Met Met Phe Pro lie His Leu Gly Val Thr Glu Ala Gly Phe 
340 345 350 

X3iy 'Asp %sn Gly Arg lie Lys Ser Tyr Leu Gly He Gly Ser Leu Leu 
355 360 365 

Tyr Asp Gly lie Gly Asp Thr He Arg He Ser Leu Thr Glu Asp Pro 
370 375 380 

Trp Glu Glu Leu Thr Pro Cys Lys Lys Leu Val Glu Asn Leu Lys Lys 
385 390 395 4 00 

Arg He Phe Tyr Asn Glu Asn Phe Lys Glu Asp Asn Glu Leu Lys Asn 
405 410 415 

Asn Glu Met Asp Thr Lys Asn Leu Leu Asn Phe Glu Glu Asn Tyr Arg 
420 425 430 

Asn Phe Asn Asn He Lys Lys Arg Asn Val Glu Lys Asn Asn Asn Val 
435 440 445 

Leu His Glu Glu Cys Thr He Gly Asn Val Val Thr He Lys Glu Leu 
450 455 460 

Glu Asp Ser Leu Gin He Phe Lys Asp Leu Asn Leu Glu Val Asp Ser 
465 470 475 480 

Asn Gly Asn Leu Lys Lys Gly Ala Lys Thr Thr Asp Met Val He He 
485 490 495 

Asn Asp Phe His Asn lie Thr Asn Leu Gly Lys Lys Thr Val Asp Lys 
500 505 510 

Leu Met Gin Val Gly He Asn He Val Val Gin Tyr Glu Pro His Asn 
515 520 525 

He Glu Ph« lie Glu Lys Met -Glu Pro Asn Asn Asp Asn Asn Asn Asn 
530 5.35 540 

Asn Asn Asn Asn Asn He Leu Phe Tyr Val Asp He Lys Asn lie Met 
545 550 555 560 

Asn Ser Ser Glu Lys Asn He Lys Leu Ser Asn Ser Lys Gly Tyr Gly 
565 570 575 

Leu He Leu Asn Gly Lys Glu Asp He Gin Thr He Lys Lys He Lys 
580 585 590 

Glu Leu Asn Arg Arg Pro Leu Phe He Leu Leu Lys Ser Asp Asn He 
595 600 605 

Tyr Glu His Val Leu He Thr Arg Arg lie Asn Glu Leu Leu Gin Ser 
610 615 620 

Leu Asn He Asn lie Pro Tyr He His Tyr Val Asp lie Asn Ser Asn 
625 630 635 640 



CA 02343919 2001-03-22 



WO 00/17233 PCT/EP99/07055 

-22- 



Asn Tyr Asp Asp lie Leu Val Asn Ser Thr Leu Tyr Ala Gly Ser Cys 
645 650 655 

Leu Met Asp Leu Met Gly Asp Gly Leu lie Val Asn Val Thr Asn Asp 
660 -665 670 

Val Leu Thr Asn Lys Lys Lys lie Glu Thr Lys Tyr Asp Glu Lys Glu 
675 680 685 

Glu Val Glu Glu Glu Gly Asn Asn Lys Asp lie His Arg Leu Leu Ser 
690 695 700 

Arg Val Ala Leu Asn Ser Phe Leu Thr Leu Asn lie Leu Gin Asp Thr 
705 710 715 720 

Arg lie Arg Leu Phe Lys Thr Asp Tyr lie Ala Cys Pro Ser Cys Gly 
725 730 735 

Arg Thr Leu Phe Asn lie Gin Glu Thr Thr Lys Lys lie Met Lys Leu 
740 745 750 

Thr Gly His Leu Lys Gly Val Lys He Ala Val Met Gly Cys He Val 
755 760 765 

Asn Gly He Gly Glu Met Ala Asp Ala His Phe Gly Tyr Val Gly Ser 
770 775 780 

Ala Pro Lys Lys He Asp Leu Tyr Tyr Gly Lys Glu Leu Val Glu . Arg 
785 790 795 800 

Asn He Pro Glu Glu Glu Ala Cys Asp Lys Leu He Glu Leu He Lys 
805 810 815 

Lys His Asn Lys Trp Lys Asp Pro 
820 



